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5 MAMMALIAN NEURO- GROWTH FACTOR- LIKE PROTEIN 

BACKGROUND OF THE INVENTION 

Proliferation and differentiation of cells of 
10 multicellular organisms are controlled by hormones and 
polypeptide growth factors. These diff usable molecules 
allow cells to communicate with each other and act in 
concert to form cells and organs, and to repair and 
regenerate damaged tissue. Examples of hormones and 
15 growth factors include the steroid hormones (e.g. 

estrogen, testosterone), parathyroid hormone, follicle 
stimulating hormone, the interleukins , platelet derived 
growth factor (PDGF) , epidermal growth factor (EGF) , 
granulocyte -macrophage colony stimulating factor (GM-CSF) , 
20 erythropoietin (EPO) and calcitonin. 

Hormones and growth factors influence cellular 
metabolism by binding to proteins. Proteins may be 
integral membrane proteins that are linked to signaling 
25 pathways within the cell, such as second messenger 

systems. Other classes of proteins are soluble molecules, 
such as the transcription factors. 

SUMMARY OF THE INVENTION 

30 

The present invention addresses this need by 
providing a novel neuro-growth factor like polypeptide 
called Zneul and related compositions and methods. Within 
.one aspect, the present invention provides an isolated 
35 polynucleotide encoding a mammalian polypeptide termed 

Zneul. The mature human Zneul polypeptide is comprised of 
a sequence of amino acids approximately 254 amino acids 
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long. Amino acid residue 2 0 of SEQ ID NO: 2, a threonine, 
is the initial amino acid of the mature polypeptide. Thus, 
it is believed that amino residues 1-19 comprise a signal 
sequence, and the mature Zneul polypeptide is represented 
5 by the amino acid sequence comprised of residues 20-254. 
The mature Zneul polypeptide is further represented by SEQ 
ID NO: 3. Mouse Zneul is defined by SEQ ID NOs:18 and 19. 
Having a signal sequence of amino acid residues 1-23, and 
the mature mouse Zneul is from 24-278 represented by SEQ 
10 ID NO: 24. Within an additional embodiment, the 

polypeptide further comprises an affinity tag. Within a 
further embodiment , the polynucleotide is DNA. 

Within a second aspect of the invention there is 
15 provided an expression vector comprising (a) a 

transcription promoter; (b) a DNA segment encoding Zneul 
polypeptide, and (c) a transcription terminator, wherein 
the promoter, DNA segment, and terminator are operably 
linked. 

20 

Within a third aspect of the invention there is 
provided a cultured eukaryotic cell into which has been 
introduced an expression vector as disclosed above, 
wherein said cell expresses a protein polypeptide encoded 

2 5 by the DNA segment. 

Within a further aspect of the invention there 
is provided a chimeric polypeptide consisting essentially 
of a first portion and a second portion joined by a 

3 0 peptide bond. The first portion of the chimeric 

polypeptide consists essentially of (a) a Zneul 
polypeptide as shown in SEQ ID NO: 2 (b) allelic variants 
of SEQ ID NO: 2; and (c) protein polypeptides that are at 
least 90% identical to (a) or (b) . The second portion of 
35 the chimeric polypeptide consists essentially of another 
polypeptide such as an affinity tag. Within one 
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embodiment the affinity tag is an immunoglobulin F c 
polypeptide. The invention also provides expression 
vectors encoding the chimeric polypeptides and host cells 
transfected to produce the chimeric polypeptides. 

5 

Within an additional aspect of the invention 
there is provided an antibody that specifically binds to a 
Zneul polypeptide as disclosed above, and also an anti- 
idiotype antibody which neutralizes the antibody to a 
10 Zneul polypeptide. 



15 



In addition to the above, the present invention 
is also directed domains of the polypeptide including SEQ 
ID N0s:8, 9, 10, 11, 12, 13, 14, 15, and 16. 



An additional embodiment of the present 
invention relates to a peptide or polypeptide which has 
the amino acid sequence of an epitope-bearing portion of a 
Zneul polypeptide having an amino acid sequence described 

20 above. Peptides or polypeptides having the amino acid 
sequence of an epitope-bearing portion of a Zneul 
polypeptide of the present invention include portions of 
such polypeptides with at least nine, preferably at least 
15 and more preferably at least 3 0 to 50 amino acids, 

25 although epitope-bearing polypeptides of any length up to 
and including the entire amino acid sequence of a 
polypeptide of the present invention described above are 
also included in the present invention. Specific examples 
of said polypeptides are defined by the amino acid 

30 sequences of SEQ ID NOs : 20-23. Also claimed are any of 

these polypeptides that are fused to another polypeptide 
or carrier molecule. 

Another embodiment of the present invention 
35 relates to a method for producing an antibody which binds " 
to a peptide or polypeptide defined by SEQ ID NOs: 2-3,8, 
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9, 11-16, and 19-24 or to a peptide or polypeptide which 
is at least 90% identical to said peptide or polypeptide 
comprising inoculating an animal with said peptide or 
polypeptide or with a nucleic acid which encodes said 
5 peptide or polypeptide, wherein said animal produces 

antibodies to said peptide or polypeptide; and isolating 
said antibody. 



10 

These and other aspects of the invention will 
become evident upon reference to the following detailed 
description . 

15 

DETAILED DESCRIPTION OF THE INVENTION 

The teachings of all of the references cited 
herein are incorporated in their entirety by reference. 

20 

The term "allelic variant" is used herein to 
denote any of two or more alternative forms of a gene 
occupying the same chromosomal locus. Allelic variation 
arises naturally through mutation, and may result in 

25 phenotypic polymorphism within populations. Gene 
mutations can be silent (no change in the encoded 
polypeptide) or may encode polypeptides having altered 
amino acid sequence. The term allelic variant is also 
used herein to denote a protein encoded by an allelic 

3 0 variant of a gene. 

The term "expression vector" is used to denote a 
DNA molecule, linear or circular, that comprises a segment 
encoding a polypeptide of interest operably linked to 
35 additional segments that provide for its transcription. 
Such additional segments include promoter and terminator 



3NSDOCID: <WO 9857983A2_I_> 



WO 98/57983 PCT/US98/12763 



sequences, and may also include one or more origins of 
replication, one or more selectable markers, an enhancer, 
a polyadenylation signal, etc. Expression vectors are 
generally derived from plasmid or viral DNA, or may 
5 contain elements of both. 

The term "isolated", when applied to a 
polynucleotide, denotes that the polynucleotide has been 
removed from its natural genetic milieu and is thus free 
10 of other extraneous or unwanted coding sequences, and is 
in a form suitable for use within genetically engineered 
protein production systems. 

"Operably linked", when referring to DNA 
15 segments, indicates that the segments are arranged so that 
they function in concert for their intended purposes, e.gr. 
transcription initiates in the promoter and proceeds 
through the coding segment to the terminator. 

20 A "polynucleotide" is a single- or double- 

stranded polymer of deoxyribonucleot ide or ribonucleotide 
bases read from the 5' to the 3 1 end. Polynucleotides 
include RNA and DNA, and may be isolated from natural 
sources, synthesized in vitro, or prepared from a 

25 combination of natural and synthetic molecules. 

The term "promoter" is used herein for its art- 
recognized meaning to denote a portion of a gene 
containing DNA sequences that provide for the binding of 
30 RNA polymerase and initiation of transcription. Promoter 
sequences are commonly, but not always, found in the 5' 
non-coding regions of genes. 

3 5 A "soluble protein" is a protein polypeptide 

that is not bound to a cell membrane. 
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Within preferred embodiments of the invention 
the isolated polynucleotides will hybridize to similar 
5 sized regions of SEQ ID NO:l, or a sequence complementary 
thereto, under stringent conditions. In general, 
stringent conditions are selected to be about 5°C lower 
than the thermal melting point (T m ) for the specific 
sequence at a defined ionic strength and pH . The T m is 

10 the temperature (under defined ionic strength and pH) at 

which 50% of the target sequence hybridizes to a perfectly 
matched probe. Typical stringent conditions are those in 
which the salt concentration is about 0.02 M or less at pH 
7 and the temperature is at least about 60°C. As 

15 previously noted, the isolated polynucleotides of the 
present invention include DNA and RNA. Methods for 
isolating DNA and RNA are well known in the art. Total RNA 
can be prepared using guanidine HC1 extraction followed by 
isolation by centrif ugation in a CsCl gradient, Chirgwin 

20 et al., Biochemistry 18:52-94 (1979). Poly (A) + RNA is 
prepared from total RNA using the method of Aviv and 
Leder, Proc. Natl. Acad. Sci. USA 63:1408-1412 (1972). 
Complementary DNA (cDNA) is prepared from poly (A) + RNA 
using known methods. Polynucleotides encoding Zneul 

25 polypeptides are then identified and isolated by, for 
example, hybridization or PCR. 

The polynucleotides of the present invention can 
be synthesized using DNA synthesizer. Currently the method 

3 0 of choice is the phosphoramidite method. If chemically 
synthesized double stranded DNA is required for an 
application such as the synthesis of a gene or a gene 
fragment, then each complementary strand is made 
separately. The production of short genes (60 to 80 bp) is 

35 technically straightforward and can be accomplished by 

synthesizing the complementary strands and then annealing 
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them. For the production of longer genes (>300 bp) , 
however, special strategies must be invoked, because the 
coupling efficiency of each cycle during chemical DNA 
synthesis is seldom 100%. To overcome this problem, 
5 synthetic genes (double-stranded) are assembled in modular 
form from single-stranded fragments that are from 20 to 
100 nucleotides in length. See Glick, Bernard R. and Jack 
J. Pasternak, Molecular Biotechnology, Principles & 
Applications of Recombinant DNA, (ASM Press, Washington, 
10 D.C. 1994), Itakura, K. et al . Synthesis and use of 

synthetic oligonucleotides. Annu . .Rev. Biochem. 53 : 323- 
356 (1984), and Climie, S. et al . Chemical synthesis of 
the thymidylate synthase gene. Proc . Natl. Acad. Sci . USA 
87 :633-637 (1990) . 

15 

Those skilled in the art will recognize that the 
sequences disclosed in SEQ ID NOS : 1 , 2 and 3 represent a . 
single allele of the human. Allelic variants of these 
sequences can be cloned by probing cDNA or genomic 
20 libraries from different individuals according to standard 
procedures . 

The present invention further provides 
counterpart proteins and polynucleotides from other 

25 species ("species orthologs"). Of particular interest are 
Zneul polypeptides from other mammalian species, including 
murine, porcine, ovine, bovine, canine, feline, equine, 
and other primates. Species orthologs of the human Zneul 
protein can be cloned using information and compositions 

3 0 provided by the present invention in combination with 

conventional cloning techniques. For example, a cDNA can 
be cloned using mRNA obtained from a tissue or cell type 
that expresses the protein. Suitable sources of mRNA can 
be identified by probing Northern blots with probes 

35 designed from the sequences disclosed herein. A library 
is then prepared from mRNA of a positive tissue or cell 
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line. A protein-encoding cDNA can then be isolated by a 
variety of methods, such as by probing with a complete or 
partial human or mouse cDNA or with one or more sets of 
degenerate probes based on the disclosed sequences. A 
5 cDNA can also be cloned using the polymerase chain 

reaction, or PCR (Mullis, U.S. Patent No. 4,683,202), 
using primers designed from the sequences disclosed 
herein. Within an additional method, the cDNA library can 
be used to transform or transfect host cells, and 

10 expression of the cDNA of interest can be detected with an 
antibody to the protein. Similar techniques can also be 
applied to the isolation of genomic clones. As used and 
claimed the language "an isolated polynucleotide which 
encodes a polypeptide, said polynucleotide being defined 

15 by SEQ ID NO: 2" includes all allelic variants and species 
orthologs of the polypeptide of SEQ ID NO : 2 . 

The present invention also provides isolated 
protein polypeptides that are substantially homologous to 

20 the polypeptide of SEQ ID NO: 3 and its species orthologs. 
By "isolated" is meant a protein or polypeptide that is 
found in a condition other than its native environment, 
such as apart from blood and animal tissue. In a 
preferred form, the isolated polypeptide is substantially 

25 free of other polypeptides, particularly other 

polypeptides of animal origin. It is preferred to provide 
the polypeptides in a highly purified form, i.e. greater 
than 95% pure, more preferably greater than 99% pure. The 
term "substantially homologous" is used herein to denote 

30 polypeptides having 50%, preferably 6 0%, more preferably 
at least 80%, sequence identity to the sequence shown in 
SEQ ID NO: 2, or its species orthologs. Such polypeptides 
will more preferably be at least 90% identical, and most 
preferably 95% or more identical to SEQ ID NO: 3, or its 

35 species orthologs. Percent sequence identity is 

determined by conventional methods. See, for example, 
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Altschul et al., Bull. Math. Bio. 48: 603-616 (1986) and 
Henikoff and Henikoff, Proc . Natl. Acad. Sci. USA 
89:10915-10919 (1992). Briefly, two amino acid sequences 
are aligned to optimize the alignment scores using a gap 
5 opening penalty of 10, a gap extension penalty of 1, and 
the "blossom 62" scoring matrix of Henikoff and Henikoff 
(ibid.) as shown in Table 1 (amino acids are indicated by 
the standard one-letter codes) . The percent identity is 
then calculated as: 

10 Total number of identical matches 

. ; x 100 

[length of the longer sequence plus the 
number of gaps introduced into the longer 
sequence in order to align the two 

15 sequences] 
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Sequence identity of polynucleotide molecules is 
determined by similar methods using a ratio as disclosed 
above . 



polypeptides are characterized as having one or more amino 
acid substitutions, deletions or additions. These changes 
are preferably of a minor nature, that is conservative 
amino acid substitutions (see Table 2) and other 

10 substitutions that do not significantly affect the folding 
or activity of the protein or polypeptide; small 
deletions, typically of one to about 30 amino acids; and 
small amino- or carboxyl- terminal extensions, such as an 
amino- terminal methionine residue, a small linker peptide 

15 of up to about 20-25 residues, or a small extension that 
facilitates purification (an affinity tag) , such as a 
poly-histidine tract, protein A, Nilsson et al . , EMBO J. 
4:1075, (1985); Nilsson et al . , Methods Enzymol . 198:3, 
(1991), glutathione S transferase, Smith and Johnson, Gene 

20 67:31, (1988), or other antigenic epitope or binding 

domain. See, in general Ford et al . , Protein Expression 
and Purification 2: 95-107, (1991). DNAs encoding 
affinity tags are available from commercial suppliers 
(e.g., Pharmacia Biotech, Piscataway, NJ) . 

25 

Table 2 



5 



Substantially homologous proteins and 



Conservative amino acid substitutions 



Basic : 



argimne 
lysine 
histidine 
glutamic acid 
aspartic acid 
glutamine 
asparagine 
leucine 



30 



Acidic : 



Polar : 



35 



Hydrophobic : 



isoleucine 



> 9857983A2_C> 



WO 98/57983 



PCT/US98/12763 



12 



Table 2, continued 



Aromatic ; 



Small 



10 



valine 

phenylalanine 

tryptophan 

tyrosine 

glycine 

alanine 

serine 

threonine 

methionine 



Essential amino acids in the polypeptides of the 
present invention can be identified according to 
procedures known in the art, such as site -directed 

15 mutagenesis or alanine-scanning mutagenesis, Cunningham 
and Wells, Science 244, 1081-1085, (1989); Bass et al . , 
Proc. Natl. Acad. Sci . USA 88:44 98-4 502, (1991). In the 
latter technique, single alanine mutations are introduced 
at every residue in the molecule, and the resultant mutant 

20 molecules are tested for biological activity (e.g., ligand 
binding and signal transduction) to identify amino acid 
residues that are critical to the activity of the 
molecule. Sites of ligand-protein interaction can also be 
determined by analysis of crystal structure as determined 

25 by such techniques as nuclear magnetic resonance, 

crystallography or photoaf f inity labeling. See, for 
example, de Vos et al . , Science 255:306-312, (1992); Smith 
et al., J. Mol. Biol. 224:899-904, (1992); Wlodaver et 
al., FEBS Lett. 309:59-64, (1992). The identities of 

30 essential amino acids can also be inferred from analysis 
of homologies with related proteins . 



Multiple amino acid substitutions can be made 
and tested using known methods of mutagenesis and 
3 5 screening, such as those disclosed by Reidhaar-Olson and 
Sauer, Science 241:53-57 , (1988) or Bowie and Sauer, Proc. 



^NSDOCID: <WO 9857983A2_L> 



WO 98/57983 PCT/US98/12763 

13 

Natl. Acad. Sci. USA £6:2152-2156, (1989) . Briefly, these 
authors disclose methods for simultaneously randomizing 
two or more positions in a polypeptide, selecting for 
functional polypeptide, and then sequencing the 
5 mutagenized polypeptides to determine the spectrum of 

allowable substitutions at each position. Other methods 
that can be used include phage display, e.g., Lowman et 
al . , Biochem. 30:10832-10837, (1991); Ladner et al . , U.S. 
Patent No. 5,223,409; Huse, WIPO Publication WO 92/06204, 
10 and region-directed mutagenesis, Derbyshire et al . , Gene 
46:145, (1986); Ner et al . , DNA 7:127, (1988) 

Mutagenesis methods as disclosed above can be 
combined with high- throughput screening methods to detect . 

15 activity of cloned, mutagenized proteins in host cells. 

Preferred assays in this regard include cell proliferation, 
assays and biosensor-based ligand-binding assays, which 
are described below. Mutagenized DNA molecules that encode 
active proteins or portions thereof (e.g., ligand-binding _ 

2 0 fragments) can be recovered from the host cells and 

rapidly sequenced using modern equipment . These methods 
allow the rapid determination of the importance of 
individual amino acid residues in a polypeptide of 
interest, and can be applied to polypeptides of unknown 

25 structure. 

Using the methods discussed above, one of 
ordinary skill in the art can prepare a variety of 
polypeptides that are substantially homologous to SEQ ID 
30 NO: 3 or allelic variants thereof and retain the properties 
of the wild- type protein. As expressed and claimed herein 
the language, "a polypeptide as defined by SEQ ID NO: 2 " 
includes all allelic variants and species orthologs of the 
polypeptide . 

35 - 
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The protein polypeptides of the present 
invention, including full-length proteins, protein 
fragments (e.g. ligand-binding fragments) , and fusion 
polypeptides can be produced in genetically engineered^ 
5 host cells according to conventional techniques. Suitable 
host cells are those cell types that can be transformed or 
transfected with exogenous DNA and grown in culture, and 
include bacteria, fungal cells, and cultured higher 
eukaryotic cells. Eukaryotic cells, particularly cultured 
10 cells of multicellular organisms, are preferred. 

Techniques for manipulating cloned DNA molecules and 
introducing exogenous DNA into a variety of host cells are 
disclosed by Sambrook et al . , Molecular Cloning: A 
Laboratory Manual, 2nd ed., Cold Spring Harbor Laboratory 
15 Press, Cold Spring Harbor, NY, (1989), and Ausubel et al . , 
ibid. 

In general, a DNA sequence encoding a Zneul 
polypeptide is operably linked to other genetic elements 
required for its expression, generally including a 
transcription promoter and terminator, within an 
expression vector. The vector will also commonly contain 
one or more selectable markers and one or more origins of 
replication, although those skilled in the art will 
recognize that within certain systems selectable markers 
may be provided on separate vectors, and replication of 
the exogenous DNA may be provided by integration into the 
host cell genome. Selection of promoters, terminators, 
selectable markers, vectors and other elements is a matter 
of routine design within the level of ordinary skill in 
the art. Many such elements are described in the 
literature and are available through commercial suppliers. 

Another embodiment of the present invention 
35 provides for a peptide or polypeptide comprising an 

epitope-bearing portion of a polypeptide of the invention. 
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The epitope of the this polypeptide portion is an 
immunogenic or antigenic epitope of a polypeptide of the 
invention. A region of a protein to which an antibody can 
bind is defined as an "antigenic epitope" . See for 
5 instance, Geysen, H.M. et al . , Proc . Natl. Acad Sci . USA 
82.-3998-4002 (1984) . 

As to the selection of peptides or polypeptides 
bearing an antigenic epitope (i.e., that contain a region 

10 of a protein molecule to which an antibody can bind) , it 
is well known in the art that relatively short synthetic 
peptides that mimic part of a protein sequence are 
routinely capable of eliciting an antiserum that reacts 
with the partially mimicked protein. See Sutcliffe, o.G. 

15 et al . Science 219:660-666 (1983). Peptides capable of 

eliciting protein-reactive sera are frequently represented 
in the primary sequence of a protein, can be characterized 
by a set of simple chemical rules, and are confined 
neither to immunodominant regions of intact proteins 

20 (i.e., immunogenic epitopes) nor to the amino or carboxyl 
terminals. Peptides that are extremely hydrophobic and 
those of six or fewer residues generally are ineffective 
at inducing antibodies that bind to the mimicked protein; 
longer soluble peptides, especially those containing 

25 proline residues, usually are effective. 

Antigenic epitope -bearing peptides and 
polypeptides of the invention are therefore useful, to 
raise antibodies, including monoclonal antibodies, that 

30 bind specifically to a polypeptide of the invention. 

Antigenic epitope-bearing peptides and polypeptides of the 
present invention contain a sequence of at least nine, 
preferably between 15 to about 3 0 amino acids contained 
within the amino acid sequence of a polypeptide of the 

35 invention. However, peptides or polypeptides comprising a 
larger portion of an amino acid sequence of the invention, 
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containing from 30 to 50 amino acids, or any length up to 
and including the entire amino acid sequence of a 
polypeptide of the invention, also are useful for inducing 
antibodies that react with the protein. Preferably, the 
5 amino acid sequence of the epitope -bearing peptide is 
selected to provide substantial solubility in aqueous 
solvents (i.e., the sequence includes relatively 
hydrophilic residues and hydrophobic residues are 
preferably avoided) ; and sequences containing proline 
10 residues are particularly preferred. All of the 

polypeptides shown in the sequence listing contain 
antigenic epitopes to be used according to the present 
invention, however, specifically designed antigenic 
epitopes include the peptides defined by SEQ ID NOs: 20-24. 

15 

Polynucleotides, generally a cDNA sequence, of 
the present invention encode the above -described 
polypeptides. A cDNA sequence which encodes a polypeptide 
of the present invention is comprised of a series of 
20 codons, each amino acid residue of the polypeptide being 

encoded by a codon and each codon being comprised of three 
nucleotides. The amino acid residues are encoded by their 
respective codons as follows. 

25 Alanine (Ala) is encoded by GCA, GCC, GCG or 

GCT; 

Cysteine (Cys) is encoded by TGC or TGT; 
Aspartic acid (Asp) is encoded by GAC or GAT 
Glutamic acid (Glu) is encoded by GAA or GAG 
30 Phenylalanine (Phe) is encoded by TTC or TTT 

Glycine (Gly) is encoded by GGA, GGC, GGG or 

GGT; 

Histidine (His) is encoded by CAC or CAT; 
Isoleucine (lie) is encoded by ATA, ATC or ATT; 
35 Lysine (Lys) is encoded by AAA, or AAG; 
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Leucine (Leu) is encoded by TTA, TTG, CTA, CTC, 
CTG or CTT; 

Methionine (Met) is encoded by ATG; 
Asparagine (Asn) is encoded by AAC or AAT; 
5 Proline (Pro) is encoded by CCA, CCC, CCG or 

CCT; , 

Glutamine (Gin) is encoded by CAA or CAG; 
Arginine (Arg) is encoded by AGA, AGG, CGA # CGC, 
CGG or CGT; 

10 Serine (Ser) is encoded by AGC , AGT , TCA, TCC, 

TCG or TCT; 

Threonine (Thr) is encoded by ACA, ACC, ACG or 

ACT; 

Valine (Val) is encoded by GTA, GTC, GTG or GTT; 
15 Tryptophan (Trp) is encoded by TGG; and 

Tyrosine (Tyr) is encoded by TAC or TAT. 

It is to be recognized that according to the 
present invention, when a cDNA is claimed as described 

20 above, it is understood that what is claimed are both the 
sense strand, the anti-sense strand, and the DNA as 
double -stranded having both the sense and ant i- sense 
strand annealed together by their respective hydrogen 
bonds. Also claimed is the messenger RNA (mRNA) which 

25 encodes the polypeptides of the present invention, and 
which mRNA is encoded by the above-described cDNA. A 
messenger RNA (mRNA) will encode a polypeptide using the 
same codons as those defined above, with the exception 
that each thymine (T) is replaced by a uracil nucleotide 

30 (U) . 

To direct a Zneul polypeptide into the secretory 
pathway of a host cell, a secretory signal sequence (also 
known as a leader sequence, prepro sequence or pre 
3 5 sequence) is provided in the expression vector. The 

secretory signal sequence may be that of the protein, or 
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may be derived from another secreted protein (e.g., t-PA) 
or synthesized de novo. The secretory signal sequence is 
joined to the Zneul DNA sequence in the correct reading 
frame. Secretory signal sequences are commonly positioned 
5 5 1 to the DNA sequence encoding the polypeptide of 
interest, although certain signal sequences may be 
positioned elsewhere in the DNA sequence of interest (see, 
e.g., Welch et al . , U.S. Patent No. 5,037,743; Holland et 
al., U.S. Patent No. 5,143,830). 

10 

Cultured mammalian cells are preferred hosts 
within the present invention. Methods for introducing 
exogenous DNA into mammalian host cells include calcium 
phosphate-mediated transf ection, Wigler et al . , Cell 

15 14:725, (1978); Corsaro and Pearson, Somatic Cell Genetics 
7:603, (1981) : Graham and Van der Eb, Virology 52 : 456 , 
(1973), electroporation, Neumann et al . , EMBO J. 1:841- 
845, (1982), DEAE-dextran mediated transf ection, Ausubel 
et a2 . , eds. , Current Protocols in Molecular Biology, John 

20 Wiley and Sons, Inc., NY, (1987), and liposome-mediated 
transf ection, Hawley-Nelson et al . , Focus 25:73, (1993); 
Ciccarone et al Focus 15:80, (1993). The production of 
recombinant polypeptides in cultured mammalian cells is 
disclosed, for example, by Levinson et al., U.S. Patent 

25 No. 4,713,339; Hagen et al . , U.S. Patent No. 4,784,950; 
Palmiter et al . , U.S. Patent No. 4,579,821; and Ringold, 
U.S. Patent No. 4,656,134. Suitable cultured mammalian 
cells include the COS-1 (ATCC No. CRL 1.650) , COS-7 (ATCC 
No. CRL 1651), BHK (ATCC No. CRL 1632), BHK 570 (ATCC No. 

30 CRL 10314), 293, ATCC No. CRL 1573; Graham et al . , J. Gen. 
Virol. 36:59-72, (1977) and Chinese hamster ovary (e.g. 
CHO-K1; ATCC No. CCL 61) cell lines. Additional suitable 
cell lines are known in the art and available from public 
depositories such as the American Type Culture Collection, 

35 Rockville, Maryland. In general, strong transcription 

promoters are preferred, such as promoters from SV-4 0 or 
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cytomegalovirus. See, e.g., U.S. Patent No. 4,956,288. 
Other suitable promoters include those from 
metallothionein genes (U.S. Patent Nos . 4,579,821 and 
4 , 601, 978 , and the adenovirus major late promoter. 

5 

Drug selection is generally used to select for 
cultured mammalian cells into which foreign DNA has been 
inserted. Such cells are commonly referred to as 
" transf ectants" . Cells that have been cultured in the 

10 presence of the selective agent and are able to pass the 
gene of interest to their progeny are referred to as 
"stable transf ectants . " A preferred selectable marker is 
a gene encoding resistance to the antibiotic neomycin. 
Selection is carried out in the presence of a neomycin- 

15 type drug, such as G-418 or the like. Selection systems 
may also be used to increase the expression level of the 
gene of interest, a process referred to as 
"amplification. " Amplification is carried out by 
culturing transf ectants in the presence of a low level of 

20 the selective agent and then increasing the amount of 
selective agent to select for cells that produce high 
levels of the products of the introduced genes. A 
preferred amplifiable selectable marker is dihydrof olate 
reductase, which confers resistance to methotrexate. 

25 Other drug resistance genes (e.g. hygromycin resistance, 
multi-drug resistance, puromycin acetyltransf erase) can 
also be used. 

Other higher eukaryotic cells can also be used 
30 as hosts, including insect cells, plant cells and avian 
cells. Transformation of insect cells and production of 
foreign polypeptides therein is disclosed by Guarino et 
al . , U.S. Patent No. 5,162,222; Bang et al . , U.S. Patent 
No. 4,775,624; and WIPO publication WO 94/06463. The use 
35 of Agrobacterlum rhizogenes as a vector for expressing 
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genes in plant cells has been reviewed by Sinkar et al . , 
cJ. Biosci . (Bangalore) 21:47-58, (1987). 

Fungal cells, including yeast cells, and 
5 particularly cells of the genus Saccharomyces , can also be 
used within the present invention, such as for producing 
protein fragments or polypeptide fusions. Methods for 
transforming yeast cells with exogenous DNA and producing 
recombinant polypeptides therefrom are disclosed by, for 

10 example, Kawasaki, U.S. Patent No. 4,599,311; Kawasaki et 
al., U.S. Patent No. 4,931,373; Brake, U.S. Patent No. 
4,870,008; Welch et al . , U.S. Patent No. 5,037,743; and 
Murray et al . , U.S. Patent No. 4,845,075. Transformed 
cells are selected by phenotype determined by the 

15 selectable marker, commonly drug resistance or the ability 
to grow in the absence of a particular nutrient (e.g., 
leucine) . A preferred vector system for use in yeast is 
the POT1 vector system disclosed by Kawasaki et al . (U.S. 
Patent No. 4,931,373), which allows transformed cells to 

20 be selected by growth in glucose-containing media. 
Suitable promoters and terminators for use in yeast 
include those from glycolytic enzyme genes (see, e.g., 
Kawasaki, U.S. Patent No. 4,599,311; Kingsman et al . , U.S. 
Patent No. 4,615,974; and Bitter, U.S. Patent No. 

25 4, 977,092 ) and alcohol dehydrogenase genes. See also U.S. 
Patents Nos. 4,990,446; 5,063,154; 5,139,936 and 
4,661,454. Transformation systems for other yeasts, 
including Hansenula polymorpha , Schl zosaccharomyces pombe, 
Kluyveromyces lactis, Kluyveromyces fragilis, Ustilago 

30 maydis, Pichia pastoris, Plchia methanolica, Pichia 

guillermondil and Candida maltosa are known in the art. 
See, for example, Gleeson et al . , J. Gen. Microbiol . 
132:3459-3465, (1986) and Cregg, U.S. Patent No. 
4,882,279. Aspergillus cells may be utilized according to 

35 the methods of McKnight et al . , U.S. Patent No. 4,935,349." 
Methods for transforming Acremonium chrysogenum are 
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disclosed by Sumino et al . , U.S. Patent No. 5,162,228. 
Methods for transforming Neurospora are disclosed by 
Lambowitz, U.S. Patent No. 4,486,533. 

5 Transformed or transfected host cells are 

cultured according to conventional procedures in a culture 
medium containing nutrients and other components required 
for the growth of the chosen host cells. A variety of 
suitable media, including defined media and complex media, 

10 are known in the art and generally include a carbon 

source, a nitrogen source, essential amino acids, vitamins 
and minerals. Media may also contain such components as 
growth factors or serum, as required. The growth medium 
will generally select for cells containing the exogenously 

15 added DNA by, for example, drug selection or deficiency in 
an essential nutrient which is "complemented by the 
selectable marker carried on the expression vector or co- 
transfected into the host cell. 

20 Within one aspect of the present invention, a 

novel protein is produced by a cultured cell, and the cell- 
is used to screen for a receptor or receptors for the 
protein, including the natural receptor, as well as 
agonists and antagonists of the natural ligand. 



25 



PROTEIN ISOLATION: 



Expressed recombinant polypeptides (or chimeric 
polypeptides) can be purified using fractionation and/or 

3 0 conventional purification methods and media. Ammonium 

sulfate precipitation and acid or chaotrope extraction may 
be used for fractionation of samples. Exemplary 
purification steps may include hydroxyapatite , size 
exclusion, FPLC and reverse -phase high performance liquid 

3 5 chromatography. Suitable anion exchange media include 

derivatized dextrans, agarose, cellulose, polyacrylamide , 
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specialty silicas, and the like. PEI, DEAE, QAE and Q 
derivatives are preferred, with DEAE Fast-Flow Sepharose 
(Pharmacia, Piscataway, NJ) being particularly preferred. 
Exemplary chromatographic media include those media 
derivatized with phenyl, butyl, or octyl groups, such as 
Phenyl -Sepharose FF (Pharmacia) , Toyopearl butyl 650 (Toso 
Haas, Mont gomeryvi lie, PA), Octyl -Sepharose (Pharmacia) 
and the like; or polyacrylic resins, such as Amberchrom CG 
71 (Toso Haas) and the like. Suitable solid supports 
include glass beads, silica-based resins, cellulosic 
resins, agarose beads, cross-linked agarose beads, 
polystyrene beads, cross -linked polyacrylamide resins and 
the like that are insoluble under the conditions in which 
they are to be used. These supports may be modified with 
reactive groups that allow attachment of proteins by amino 
groups, carboxyl groups, sulfhydryl groups, hydroxyl 
groups and/or carbohydrate moieties. Examples of coupling 
chemistries include cyanogen bromide activation, N- 
hydroxysuccinimide activation, epoxide activation, 
sulfhydryl activation, hydrazide activation, and carboxyl 
and amino derivatives for carbodiimide coupling 
chemistries. These and other solid media are well known 
and widely used in the art, and are available from 
commercial suppliers. Methods for binding receptor 
polypeptides to support media are well known in the art. 
Selection of a particular method is a matter of routine 
design and is determined in part by the properties of the 
chosen support. See, for example, Affinity 
Chromatography: Principles & Methods, Pharmacia LKB 
30 Biotechnology, Uppsala, Sweden, (1988) . 

The polypeptides of the present invention can be 
isolated by exploitation of their properties. For 
example, immobilized metal ion adsorption (IMAC) 
35 chromatography can be used to purify histidine-rich 

proteins. Briefly, a gel is first charged with divalent 
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metal ions to form a chelate, E. Sulkowski, Trends in 
Biochem. 3:1-7, (1985). .Hist idine-rich proteins will be 
adsorbed to this matrix with differing affinities, 
depending upon the metal ion used, and will be eluted by 
5 competitive elution, lowering the pH, or use of strong 
chelating agents. Other methods of purification include 
purification of glycosylated proteins by lectin affinity 
chromatography and ion exchange chromatography, Methods, in 
Enzymol . , Vol. 182, "Guide to Protein Purification", M. 
10 Deutscher, (ed.), Acad. Press, San Diego, (1990), pp. 529- 
39. Alternatively, a fusion of the polypeptide of 
interest and an affinity tag (e.g., polyhist idine , 

maltose-binding protein, an immunoglobulin domain) may be 
constructed to facilitate purification. 

15 

Physical Structure of Zneul 

The Zneul polypeptide shown in SEQ ID NO: 2 has 
a signal peptide including amino acid residues 1-19. Amino 

20 acid residues 20-104 define a hydrophilic domain 

homologous to an HSMHC3W5A domain, SEQ ID NO: 17, (GenBank 
No. gl401159) . Amino acid residues 105-135 define a domain 
homologous to an Epidermal Growth Factor (EGF) domain. 
Amino acid residues 136-177 define another domain 

2 5 homologous to an EGF domain; and amino acid residues 17 8- 
2 73 define a domain also homologous to an HSMHC3W5A 
domain . 

However, the first EGF-like domain (EGF1) of 
30 Zneul, SEQ ID NO: 9 which corresponds to amino acid 

residues 105 to 135 of SEQ ID NO: 2, is distinct from any 
other EGF domain in the prior art. The EGF1 in Zneul is 
about 56% similar to the HSMHC3W5A_6 domain, its closest 
human relative. 

35 
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The second EGF-like domain (EGF2) of Zneul, SEQ 
ID NO: 10 which corresponds to amino acid residues 136 to 
177 of SEQ ID NO: 2, is distinct from any other EGF domain 
in the prior art. EGF2 of Zneul is about 48% similar to" 
5 PIR_S31101 fibrillin, its closest human relative. 

The first HSMHC3W5A- like (HSM1) domain of Zneul, 
SEQ ID NO: 8 which corresponds to amino acid residues 20- 
104 of SEQ ID NO: 2. SEQ ID NO: 8 is approximately 38% 
10 similar to HSMHC3W5A, its closest human relative. 

The second HSMHC3W5A-like domain (HSM2) of 
Zneul, SEQ ID NO: 11 which corresponds to amino acid 
residues 178-273 of SEQ ID NO: 2, is distinct from any 
15 other polypeptide in the prior art. It is about 32% 
similar to HSMHC3W5A_6 . 

Uses 

20 The tissue specificity of Zneul expression 

indicates that Zneul can be used as a growth, maintenance, 
or differentiation factor in the spinal cord, heart, 
spleen, testis, thyroid and lymph nodes. 

25 The present invention also provides reagents 

which will find use in diagnostic applications. For 
example, the Zneul gene has been mapped on chromosome 
9q34.3. A Zneul nucleic acid probe could be used to check 
for abnormalities in chromosome 9. In a normal chromosome 

3 0 9, one would predict that a Zneul nucleic acid probe would 
hybridize to chromosome 9. If the probe does not hybridize 
to chromosome 9, this would indicate an abnormality in 
chromosome 9 . 

35 Zneul' s closest human homolog is HSMHC3W5A a 

gene in the HLA class III region, which is contained in a 
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cosmid which contains Notch 4. Zneul is also homologous to 
Notch 4 in its EGF-like domains. Zneul may be involved in 
EGF-receptor pathways. 

5 Notch Structure/Function 

The original member of this gene family was the 
Drosophila gene Notch which controls cell fate decisions 
in the development of the peripheral nervous system. 

10 Notch is a cell surface receptor with a single 

transmembrane domain. Homologues have now been found in C. 
elegans {linl2 and glpl) , Xenopus, mouse and human. All 
members of the Notch family have large numbers of EGF-like 
motifs (29-39 in mouse, 10-13 in C. elegrans) and three or 

15 more copies of LNR (linl2/ Notch repeats) in the 

extracellular domain. Notch family members also contain . 
six copies of the cdclO/SWI6 motif (also called ankyrin 
repeats) and a PEST protein degradation sequence in the 
intracellular domain. Specific EGF repeats (Drosophila 

20 repeats 11 and 12) are involved in ligand binding. LNR 
may be regulatory domains which bind ligand when high 
ligand concentrations exist and cause decreased activity 
of Notch. CdclO/SWI6 domains are involved in protein- 
protein interactions with components of the Notch- 

25 activated signal transduction pathway. 

Notch Biology 

Two different translocations led to formation of 
30 altered Notch genes resulting in an oncogenic state. The 
TAN- 1 oncogene is a fusion of part of the (5 T cell 
receptor with a small region of the human Notch 1 
extracellular domain and the entire intracellular domain. 
TAN-1 is an activated form of Notch which causes T- 
35 lymphoblastic leukemias. The int-3 oncogene is caused by" 
integration of the mouse mammary tumor virus into the 
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Notch 4 gene resulting in expression of the intact 
intracellular domain. Int-3 also is an activated form of 
Notch which leads to mammary carcinoma. 

The function of Notch family members has been 
extensively studied in Drosophila and C. elegans . These 
proteins control binary decisions that depend on cell-cell 
interactions. Notch proteins act consistent with their 
proposed role as a receptor. Gain-of -function and loss- 
of-function Notch alleles result in opposite cell fate 
decisions. Notch receptors and their ligands play 
important roles in lateral inhibition, the process whereby 
signaling between neighboring cells is amplified by a 
feedback loop between Notch and its ligand. This process 
results in increased receptor activity in some cells and 
increased ligand activity in others leading to the 
distinction between signaling cells and receiving cells. 

It has recently been shown that the expression 
of an activated form of Notchl in developing T cells of 
the mouse leads to both an increase in CD8 lineage T cells 
and a decrease in CD4 lineage T cells. Expression of 
activated Notch permits the development of mature CD 8 
lineage thymocytes even in the absence of class I major 
2 5 histocompatability complex (MHO proteins, ligands that 
are normally required for the development of these cells. 
However, activated Notch is not sufficient to promote CDS 
when both class I and class II MHC are absent. These 
results implicate Notch as a participant in the CD4 versus 
30 CD 8 lineage decision. Robey, E. et al . Cell 87: 483-492 
(1996) . 

Mutations in a gene region called CADASIL (for 
cerebral autosomal dominant arteriopathy with subcortical 
35 infarcts and leukoencephalopathy) on chromosome 19 are 
associated with a type of stroke and dementia whose key 
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features include recurrent subcortical ischaemic events 
and vascular dementia. Notch3 has been mapped to this 
region, and mutations in CAD AS I L patients indicate that 
Notch3 could be the defective protein in CADASIL patients, 
5 Joutel, A. et al . Mature 383:707-710 (1996). 

Notch Ligands 

There is also a conserved family of ligands for 
10 the Notch receptor family. Multiple ligands are able to 
activate the same receptor. For example, delta and 
serrate each act as ligands for Drosophila Notch. These 
ligands all contain EGF repeats (from 1-14), a DSL domain 
(delta, serrate, lag- 2) and a transmembrane domain. 
15 Therefore, receptor and ligand are homologous to one 
another. In addition, receptor and ligand are often 
coexpressed and are associated with each other in 
vesicles . 

20 Zneul Structure 

Zneul is similar to Notch and its ligands in 
having two EGF repeats. However, it has a small number of 
EGF repeats and lacks a membrane spanning domain, 

25 linl2/Notch domains and ankyrin repeats. Based on 

structure/function experiments of Notch, one would predict 
that Zneul would antagonize Notch function. If the EGF 
repeats in zneul could bind receptor, it could inhibit 
ligand binding on neighboring cells. Furthermore, Zneul 

30 may have its own target receptor for which it would be an 
agonist. 

Zneul Tissue Distribution/Multiple mRNA sizes 

35 Zneul is widely expressed in adult human 

tissues. Zneul is most highly expressed in heart, 
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placenta, spleen, testis, thyroid, spinal cord and lymph 
node. Dot blots indicate that Zneul is also expressed in 
a variety of fetal tissues. There are at least three mRNA 
sizes: 

5 

1.3 kb mRNA only in brain and testis 1.7 kb only 
in lymph node 

1.3 + 1.7 in multiple tissues 

2.4 kb only in placenta 

10 

Since the sequence of Zneul is from the 1.3 kb 
mRNA in brain, it is difficult to predict what types of 
molecules the larger transcripts encode. It is possible 
that larger forms could encode soluble Zneul proteins with 
15 more EGF repeats and other domains observed in Notch or 
Notch ligands. Alternatively, the extra sequences could 
encode transmembrane and intracellular domains. 

Possible relationship to Notch function 

20 

It is difficult to predict whether Zneul will 
act as a Notch ligand or to antagonize the activity of 
other Notch ligands by competing for receptor binding. 
Zneul may alter the binary decisions in differentiation of 
25 stem cells into specific lineages or may alter the cell 
fate decisions of adjacent cells. 

Alternatively, Zneul may have nothing to do with 
Notch. Many proteins have EGF repeats. Zneul may act as 
30 a growth factor for a different class of receptor. 

Other Possible Roles 

• role in breast cancer (EGF-receptor is 
3 5 overexpressed in many breast cancers) 
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• role in glioblastomas, pituitary 

adenomas . 

Mapping Data 

5 

Zneul maps to human chromosome 9q34.3, in the 
same chromosomal band as Notchl . It is of interest that 
Notch4 and HSNIHC3W5A are also linked at the MHC III locus, 
i.e., duplication of an authentic Notch receptor and a 2 
10 EGF- repeat novel protein. 



The rapeut ic utility 



Zneul and its antagonists can be used as 
therapeutic reagents for the following. 

1. Alzheimer's disease 

The Sell2 gene was identified as a suppresser of 
a linl2 gain-of -function mutant. Sell2 is a homolog of a 
positional cloned human early-onset familial Alzheimer's 
disease gene. Therefore, Zneul could affect a pathway 
affecting this disease and it is expressed in brain, 
albeit at lower levels than most other tissues. 

2 . Cancer 



There are a number of chromosomal rearrangements 
3 0 associated with breakpoints at 9q34 including Non- 

Hodgkin's lymphoma and acute myeloid leukemia. A probe for 
Zneul which does not properly hybridize to chromosome 9q34 
would indicate an abnormality of chromosome 9 and would 
indicate a possible predilection of the individual for 
3 5 developing cancer. 
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Given the possible association with Notch 4, an 
endothelial-specific gene, Zneul could be involved in' 
promoting or inhibiting endothelial cell tumors such as 
hemangiopericytomas? Another possibility is in 
5 angiogenesis since blocking a tumor's blood supply would 
be an effective cancer treatment . 

Given the tissues where Zneul is highly 
expressed, the most prevalent forms of cancer would be in 
10 the testis and lymph nodes. 

3 . Hematopoiesis 



15 
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35 



Moore et al (PNAS 94:4011-4016, 1997) implicated 
delta-like (a mammalian Notch ligand) in promoting both 
high-proliferative potential progenitors and in stem cell 
repopulation. Since Zneul is highly expressed in lymph 
node and spleen, it could either be involved in inhibiting 
differentiation to promote stem cell self -renewal or in 
determination of progenitor populations. Possible use in 
repopulating blood cells after chemotherapy treatment or 
in vitro expansion of stem cells. 

4 . Heart 

Stimulation of myofibroblast proliferation or 
migration in the repair process after myocardial 
infarction. Recently, a frizzled homolog has been 
implicated in this process. There is evidence for 
interactions between the frizzled and Notch pathways in 
Drosophila. 

5 . Placenta 

Stimulation or inhibition of various growth 
factor made in placenta. 
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6. Testis 

Role in fertility or contraception 

5 

7 . Spinal cord 

Zneul may play a role in Nerve regeneration 
since Notch plays a role in neurogenesis in both flies and 
10 mammalian cells. 



The present invention also provides reagents 
with significant therapeutic value. The Zneul polypeptide 

15 (naturally occurring or recombinant) , fragments thereof, 
antibodies and ant i-idiotypic antibodies thereto, along 
with compounds identified as having binding affinity to 
the Zneul polypeptide, should be useful in the treatment 
of conditions associated with abnormal physiology or 

20 development, including abnormal proliferation, e.g., 
cancerous conditions, or degenerative conditions. For 
example, a disease or disorder associated with abnormal 
expression or abnormal signaling by a Zneul polypeptide 
should be a likely target for an agonist or antagonist of 

2 5 the Zneul polypeptide. 

Antibodies to the Zneul polypeptide can be 
purified and then administered to a patient. These 
reagents can be combined for therapeutic use with 

30 additional active or inert ingredients, e.g., in 

pharmaceutical^ acceptable carriers or diluents along 
with physiologically innocuous stabilizers and excipients. 
These combinations can be sterile filtered and placed into 
dosage forms as by lyophilizat ion in dosage vials or 

35 storage in stabilized aqueous preparations. This invention 
also contemplates use of antibodies, binding fragments 
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thereof or single-chain antibodies of the antibodies 
including forms which are not complement binding. 

The quantities of reagents necessary for 
effective therapy will depend upon many different factors, 
including means of administration, target site, 
physiological state of the patient, and other medications 
administered. Thus, treatment dosages should be titrated 
to optimize safety and efficacy. Typically, dosages used 
in vitro may provide useful guidance in the amounts useful 
for in vivo administration of these reagents. Animal 
testing of effective doses for treatment of particular 
disorders will provide further predictive indication of 
human dosage. Methods for administration include oral, 
intravenous, peritoneal, intramuscular, or transdermal 
administration. Pharmaceut ically acceptable carriers will 
include water, saline, buffers to name just a few. Dosage 
ranges would ordinarily be expected from ljj.g to lOOOfig per 
kilogram of body weight per day. However, the doses by be 
higher or lower as can be determined by a medical doctor 
with ordinary skill in the art. For a complete discussion 
of drug formulations and dosage ranges see Remington' s 
Pharmaceutical Sciences , 17 th Ed., (Mack Publishing Co., 
Easton, Penn . , 1990), and Goodman and Oilman's: The 
Pharmacological Bases of Therapeutics , 9 th Ed. (Pergamon 
Press 1996) . 

Nucleic Acid-based Therapeutic Treatment 

If a mammal has a mutated or lacks a Zneul gene, 
the Zneul gene can be introduced into the cells of the 
mammal. In one embodiment, a gene encoding a Zneul 
polypeptide is introduced in vivo in a viral vector. Such 
vectors include an attenuated or defective DNA virus, such 
as but not limited to herpes simplex virus (HSV) , 
papillomavirus, Epstein Barr virus (EBV) , adenovirus, 
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adeno-associated virus (AAV), and the like. Defective 
viruses , which entirely or almost entirely lack viral 
genes, are preferred. A defective virus is not infective 
after introduction into a cell. Use of defective viral' 
5 vectors allows for administration to cells in a specific, 
localized area, without concern that the vector can infect 
other cells. Examples of particular vectors include, but 
are not limited to, a defective herpes virus 1 (HSV1) 
vector [Kaplitt et al . , Molec. Cell. Neurosci.,2 :320-330 

10 (1991)], an attenuated adenovirus vector, such as the 

vector described by Stratf ord-Perricaudet et al . , J. Clin. 
Invest., 90 :626-630 (1992), and a defective adeno- 
associated virus vector [Samulski et al . , J". Virol., 
61:3096-3101 (1987); Samulski et al . J. Virol., 63:3822- 

15 3828 (1989) ] . 

In another embodiment, the gene can be 
introduced in a retroviral vector, e.g., as described in 
Anderson et al . , U.S. Patent No. 5,399,346; Mann et al., 

20 Cell, 33:153 (1983); Temin et al . , U.S. Patent No. 
4,650,764; Temin et al . , U.S. Patent No. 4,980,289; 
Markowitz et al . , J. Virol., 62:1120 (1988); Temin et al . , 
U.S. Patent No. 5,124,263; International Patent 
Publication No. WO 95/07358, published March 16, 1995 by 

25 Dougherty et al . / and Blood, 82:845 (1993). 

Alternatively, the vector can be introduced by 
lipofection in vivo using liposomes. Synthetic cationic 
lipids can be used to prepare liposomes for in vivo 

30 transfection of a gene encoding a marker [Feigner et al . , 
Proc. Natl. Acad. Sci. USA, 84:7413-7417 (1987); see 
Mackey et al . , Proc . Natl. Acad. Sci. USA, 85:8027-8031 
(1988)]. The use of lipofection to introduce exogenous 
genes into specific organs in vivo has certain practical 

35 advantages. Molecular targeting of liposomes to specific " 
cells represents one area of benefit. It is clear that 
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directing transfection to particular cells represents one 
area of benefit. It is clear that directing transfection 
to particular cell types would be particularly- 
advantageous in a tissue with cellular heterogeneity, such 
5 as the pancreas, liver, kidney, and brain. Lipids may be 
chemically coupled to other molecules for the purpose of 
targeting. Targeted peptides, e.g., hormones or 
neurotransmitters, and proteins such as antibodies, or 
non-peptide molecules could be coupled to liposomes 
10 chemically. 

It is possible to remove the cells from the body 
and introduce the vector as a naked DNA plasmid and then 
re- implant the transformed cells into the body. Naked DNA 

15 vector for gene therapy can be introduced into the desired 
host cells by methods known in the art, e.g., 
transfection, electroporation, microinjection, 
transduction, cell fusion, DEAE dextran, calcium phosphate 
precipitation, use of a gene gun or use of a DNA vector 

20 transporter [see, e.g., Wu et al . , J. Biol. Chem. , 

267:963-967 (1992); Wu et al . , J. Biol. Chew., 263 ; 14621- 
14624 (1988) ] . 

ANTIBODIES 

25 

ZNEU1 polypeptides can also be used to prepare 
antibodies that specifically bind to Zneul epitopes, 
peptides or polypeptides. The Zneul polypeptide or a 
fragment thereof serves as an antigen (immunogen) to 

30 inoculate an animal and elicit an immune response. 

Suitable antigens would be the Zneul polypeptide encoded 
by SEQ ID NO: 2 or 3 or at least a contiguous 9 amino acid 
fragment thereof . Antibodies generated from this immune 
response can be isolated and purified as described herein. 

35 Methods for preparing and isolating polyclonal and 

monoclonal antibodies are well known in the art. See, for 
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example, Current Protocols in Immunology, Cooligan, et al . 
(eds.) , National Institutes of Health, (John Wiley and 
Sons, Inc., 1995); Sambrook et al., Molecular Cloning: A 
Laboratory Manual, Second Edition (Cold Spring Harbor, NY, 
5 1989); and Hurrell, J. G. R. , Ed., Monoclonal Hybridoma 
Antibodies: Techniques and Applications (CRC Press, Inc., 
Boca Raton, FL, 1982). 

As would be evident to one of ordinary skill in 
10 the art, polyclonal antibodies can be generated from 
inoculating a variety of warm-blooded animals such as 
horses, cows, goats, sheep, dogs, chickens, rabbits, mice, 
and rats with a Zneul polypeptide or a fragment thereof. 
The immunogenicity of a Zneul polypeptide may be increased 
15 through the use of an adjuvant, such as alum (aluminum 

hydroxide) or Freund's complete or incomplete adjuvant. . . . 
Polypeptides useful for immunization also include fusion 
polypeptides, such as fusions of Zneul or a portion 
thereof with an immunoglobulin polypeptide or with maltose 
* 20. binding protein. The polypeptide immunogen may be a full- . 
length molecule or a portion thereof. If the polypeptide 
portion is "hapten-like" , such portion may be 
advantageously joined or linked to a macromolecular 
carrier (such as keyhole limpet hemocyanin (KLH) , bovine 
25 serum albumin (BSA) or tetanus toxoid) for immunization. 

As used herein, the term "antibodies" includes 
polyclonal antibodies, affinity-purified polyclonal 
antibodies, monoclonal antibodies, and antigen-binding 
30 fragments, such as F(ab')2 and Fab proteolytic fragments. 
Genetically engineered intact antibodies or fragments, 
such as chimeric antibodies, Fv fragments, single chain 
antibodies and the like, as well as synthetic antigen- 
binding peptides and polypeptides, are also included. 



NSDOCID: <WO 9857983A2_I_> 



WO 98/57983 



36 



PCT/US98/12763 



10 



15 



Non-human antibodies may be humanized by grafting non- 
human CDRs onto human framework and constant regions, or 
by incorporating the entire non-human variable domains 
(optionally "cloaking" them with a human-like surface by 
replacement of exposed residues, wherein the result is a 
"veneered" antibody) . In some instances, humanized 
antibodies may retain non-human residues within the human 
variable region framework domains to enhance proper 
binding characteristics. Through humanizing antibodies, 
biological half-life may be increased, and the potential 
for adverse immune reactions upon administration to humans 
is reduced. 

Alternative techniques for generating or 
selecting antibodies useful herein include in vitro 
exposure of lymphocytes to Zneul protein or peptide, and 
selection of antibody display libraries in phage or 
similar vectors (for instance, through use of immobilized 
or labeled Zneul protein or peptide) . Genes encoding 
20 polypeptides having potential Zneul polypeptide binding 
domains can be obtained by screening random peptide 
libraries displayed on phage (phage display) or on 
bacteria, such as E. coli. Nucleotide sequences encoding 
the polypeptides can be obtained in a number of ways, such 
25 as through random mutagenesis and random polynucleotide 

synthesis. These random peptide display libraries can be 
used to screen for peptides which interact with a known 
target which can be a protein or polypeptide, such as a 
ligand or receptor, a biological or synthetic 
3 0 macromolecule, or organic or inorganic substances. 

Techniques for creating and screening such random peptide 
display libraries are known in the art (Ladner et al . , us 
Patent NO. 5,223,409; Ladner et al . , US Patent NO. 
4,946,778; Ladner et al . , US Patent NO. 5,403,484 and 
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Ladner et al., US Patent NO. 5,571,698) and random peptide 
display libraries and kits for screening such libraries 
are available commercially, for instance from Clontech 
(Palo Alto, CA) , Invitrogen Inc. (San Diego, CA) , New 
5 England Biolabs, Inc. (Beverly, MA) and Pharmacia LKB 
Biotechnology Inc. (Piscataway, NJ) . Random peptide 
display libraries can be screened using the Zneul 
sequences disclosed herein to identify proteins which bind 
to Zneul. These "binding proteins" which interact with 

10 Zneul polypeptides can be used for tagging cells; for 

isolating homolog polypeptides by affinity purification; 
they can be directly or indirectly conjugated to drugs, 
toxins, radionuclides and the like. These binding 
proteins can also be used in analytical methods such as 

15 for screening expression libraries and neutralizing 

activity. The binding proteins can also be used for 
diagnostic assays for determining circulating levels of 
polypeptides; for detecting or quantitating soluble 
polypeptides as marker of underlying pathology or disease., 

20 These binding proteins can also act as Zneul "antagonists" 
to block Zneul binding and signal transduction in vitro 
and in vivo. These ant i- Zneul binding proteins would be 
useful for down regulating the effect of Zneul. 

25 Antibodies are determined to be specifically 

binding if: 1) they exhibit a threshold level of binding 
activity, and/or 2) they do not significantly cross-react 
with related polypeptide molecules. First, antibodies 
herein specifically bind if they bind to a Zneul 

30 polypeptide, peptide or epitope with a binding affinity 

(K a ) of 10 6 M _1 or greater, preferably 10 7 M* 1 or greater, 

p _ -i 

more preferably 10 M or greater, and most preferably 
Q — 1 

10 M or greater. The binding affinity of an antibody -■ 
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can be readily determined by one of ordinary skill in the 
art, for example, by Scatchard analysis. 

Second, antibodies are determined to 
5 specifically bind if they do not significantly cross-react 
with related polypeptides. Antibodies do not significantly 
cross-react with related polypeptide molecules, for 
example, if they detect Zneul but not known related 
polypeptides using a standard Western blot analysis 

10 (Ausubel et al., ibid.). Examples of known related 

polypeptides are orthologs, proteins from the same species 
that are members of a protein family (e.g. IL-16) , Zneul 
polypeptides, and non-human Zneul. Moreover, antibodies 
may be "screened against" known related polypeptides to 

15 isolate a population that specifically binds to the 

inventive polypeptides. For example, antibodies raised to 
Zneul are adsorbed to related polypeptides adhered to 
insoluble matrix; antibodies specific to Zneul will flow 
through the matrix under the proper buffer conditions. 

20 Such screening allows isolation of polyclonal and 

monoclonal antibodies non-crossreact ive to closely related 
polypeptides, Antibodies : A Laboratory Manual, Harlow and 
Lane (eds.) (Cold Spring Harbor Laboratory Press, 1988); 
Current Protocols in Immunology, Cooligan, et al. (eds.), 

25 National Institutes of Health (John Wiley and Sons, Inc., 
1995) . Screening and isolation of specific antibodies is 
well known in the art. See, Fundamental Immunology, Paul 
(eds.) (Raven Press, 1993); Getzoff et al., Adv. in 
Immunol. 43: 1-98 (1988); Monoclonal Antibodies: 

30 Principles and Practice, Coding, J.W. (eds.), (Academic 
Press Ltd., 1996); Benjamin et al., Ann. Rev. Immunol. 2: 
67-101 (1984) . 
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A variety of assays known to those skilled in 
the art can be utilized to detect antibodies which 
specifically bind to Zneul proteins or peptides. 
Exemplary assays are described in detail in Antibodies': A 
Laboratory Manual, Harlow and Lane (Eds.) (Cold Spring 
Harbor Laboratory Press, 1988). Representative examples 
of such assays include: concurrent Immunoelectrophoresis, 
radioimmunoassay, radioimmuno-precipitation, enzyme-linked 
immunosorbent assay (ELISA) , dot blot or Western blot 
assay, inhibition or competition assay, and sandwich 
assay. In addition, antibodies can be screened for 
binding to wild-type versus mutant Zneul protein or 
polypeptide . 



Antibodies to Zneul may be used for tagging 
cells that express Zneul; for isolating Zneul by affinity 
purification; for diagnostic assays for determining 
circulating levels of Zneul polypeptides; for detecting or 
quantitating soluble Zneul as marker of underlying 
20 pathology or disease; in analytical methods employing - 
FACS; for screening expression libraries; for generating v. 
anti-idiotypic antibodies; and as neutralizing antibodies 
or as antagonists to block Zneul in vitro and in vivo. 
Suitable direct tags or labels include radionuclides, 
enzymes, substrates, cofactors, inhibitors, fluorescent 
markers, chemiluminescent markers, magnetic particles and 
the like; indirect tags or labels may feature use of 
biotin-avidin or other complement /anti-complement pairs as 
intermediates. Antibodies herein may also be directly or 
indirectly conjugated to drugs, toxins, radionuclides and 
the like, and these conjugates used for in vivo diagnostic 
or therapeutic applications. Moreover, antibodies to 
Zneul or fragments thereof may be used in vitro to detect 
denatured Zneul or fragments thereof in assays, for 
example, Western Blots or other assays known in the art. ' 



35 
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An additional embodiment of the present 
invention relates to a peptide or polypeptide which has 
the amino acid sequence of an epitope -bearing portion of a 
Zneul polypeptide having an amino acid sequence described 
above. Peptides or polypeptides having the amino acid 
sequence of an epitope-bearing portion of a Zneul 
polypeptide of the present invention include portions of 
such polypeptides with at least nine, preferably at least 
15 and more preferably at least 3 0 to 50 amino acids, 
although epitope-bearing polypeptides of any length up to 
and including the entire amino acid sequence of a 
polypeptide of the present invention described above are 
also included in the present invention. Examples of said 
polypeptides are defined by the amino acid sequences of 
15 SEQ ID NOs: 20-23. Also claimed are any of these 

polypeptides that are fused to another polypeptide or 
carrier molecule. 



10 



20 



25 



The invention is further illustrated by the 
following non- limiting examples. 

Example 1. Cloning of Zneul 



Zneul was identified from expressed sequence tag 
(EST) SEQ ID NO: 4. The cDNA clone containing the EST was 
discovered in a brain cDNA library which contained the 
EST. The cDNA was isolated from E. coli transfected with 
30 the plasmid and then streaked out on an LB 100 jig/ml 
ampicillin and 100 ug/ml methicillin plate. The cDNA 
insert was sequenced. The insert was determined to be 1514 
base pairs long with a 274 amino acid open reading frame 
and a putative 19 amino acid signal peptide. 
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Example 2 
Northern Blot Analysis 

Human multiple tissue blots 1,2,3 (Clontech) were 
probed to determine the tissue distribution of Zneul. A 
5 Hindlll/ NotI fragment containing the entire Zneul coding 
region was generated from the isolated cDNA clone and used 
for the probe. A plasmid prep of the clone was prepared 
from a 5 ml LB 100 |ig/ml ampicillin overnight culture at 
37° using the QIAprep Spin Miniprep Kit (Qiagen) . 20 jil out 

10 of 100 nl were digested with 3 ^1 of NEB Buffer 3, 10 
units of Hindlll (Gibco BRL) and 10 units Notl (New 
England Biolabs) in a 30 jil reaction at 37°C for 2 hours. 
The digest was electrophoresed on a 0.8% TBE agarose gel 
and the fragment was cut out. The DNA was extracted from 

15 the gel slab with a QIAquick Gel Extraction Kit (Qiagen) . 

25 ng of this DNA was labeled with P 32 using the Multiprime 
DNA Labeling System (Amersham) and unincorporated 
radioactivity was removed with a NucTrap Probe 
Purification Column (Stratagene) . Multiple tissue 

20 northerns and a human RNA master blot were prehybridized 3 
hours with 10 ml ExpressHyb Solution and added to blots - 
Hybridization was carried out overnight at 42°C with a 10 
ml solution of probe containing a concentration of 2 x 
10 6 /ml of probe to which 1 mg of salmon sperm DNA was added 

25 which had been boiled for 5 minutes and then iced 1 minute 
and added to 10 ml of ExpressHyb Solution (Clontech) . 
Initial wash conditions were as follows: 2X SSC, 0.05% SDS 
RT for 40 minutes with several changes of solution then 
0.1X SSC, 0.1% SDS at 65°C for 40 minutes, 1 solution 

30 change. Blots were than exposed to film a -80°C. There was 
cross hybridization/background so blots were further 
washed at 72°C then 65°C with 0.1% X SSC, 0.1% SDS for 1 
hour each . ~ - 
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The results showed that Zneul is widely expressed in 
adult tissues. Zneul is highly expressed in heart, 
placenta, spleen, testis, thyroid, spinal cord and lymph 
node. There are at least three mRNA sizes: 

5 1 . 3 kb mRNA only in brain and testis; 

1.4 kb only in lymph node; 

1.5 + 1.7 kb in multiple tissues; and 
2.4 kb only in placenta. 



10 Example 3 

Chromosomal Assignment and Placement of Zneul. 

Zneul was mapped to chromosome 9 using the 
15 commercially available "GeneBridge 4 Radiation Hybrid 
Panel" (Research Genetics, Inc., Huntsville, AL) . The 
GeneBridge 4 Radiation Hybrid Panel contains PCRable DNAs 
from each of 93 radiation hybrid clones, plus two control 
DNAs (the HFL donor and the A23 recipient). A publicly 
20 available WWW server (http://www-genome.wi.mit.edu/cgi- 
bin/cont ig/rhmapper . pi ) allows mapping relative to the 
Whitehead Institute/MIT Center for Genome Research's 
radiation hybrid map of the human genome (the "WICGR" 
radiation hybrid map) which was constructed with the 
25 GeneBridge 4 Radiation Hybrid Panel. 

For the mapping of Zneul with the "GeneBridge 4 RH 
Panel", 20 pi reactions were set up in a PCRable 96-well 
microtiter plate (Stratagene, La Jolla, CA) and used in a 
30 "RoboCycler Gradient 96" thermal cycler (Stratagene) . Each 
of the 95 PCR reactions consisted of 2 pi 10X KlenTaq PCR" 
reaction buffer (CLONTECH Laboratories, Inc., Palo Alto, 
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CA), 1.6 yl dNTPs mix (2.5 mM each, PERKIN- ELMER, Foster 
City, CA) , 1 pi sense primer, SEQ ID NO: 6, 1 yl antisense 
primer, SEQ ID NO: 7, 2 pi "RediLoad" (Research Genetics, 
Inc., Huntsville, AL) , 0.4 pi SOX Advantage KlenTaq 
5 Polymerase Mix (Clontech Laboratories, Inc.), 25 ng of DNA 
from an individual hybrid clone or control and x pi ddH2C 
for a total volume of 20 pi. The reactions were overlaid 
with an equal amount of mineral oil and sealed. The PCR 
cycler conditions were as follows: an initial 1 cycle 5 

10 minute denaturation at 95°C, 35 cycles of a 1 minute 

denaturation at 95°C, 1 minute annealing at 70°C and 1.5 
minute extension at 72°C, followed by a final 1 cycle 
extension of 7 minutes at 72°C. The reactions were 
separated by electrophoresis on a 2% agarose gel (Life 

15 Technologies, Gaithersburg , MD) . 

The results showed that Zneul maps 529.80 cR_3000 
from the top of the human chromosome 9 linkage group on 
the WICGR radiation hybrid map, 7.90 cR_3000 distal of 
20 framework marker D9S158. This positions Zneul in the 

9q34.3 region on the integrated LDB chromosome 9 map (The : 
Genetic Location Database, University of Southhampton, WWW- 
server : http : //cedar . genetics . sot on . ac . uk/public_html/ ) . 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION 

(i) APPLICANT: ZymoGenetics . Inc. 

1201 Eastlake Ave East 

Seattle 

WA 

USA 

98102 

(ii) TITLE OF THE INVENTION: MAMMALIAN NEURO-GROWTH FACTOR LIKE 
PROTEIN 



(iii) NUMBER OF SEQUENCES: 24 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Zymogenetics 

(B) STREET: 1201 Eastlake Ave East 

(C) CITY: Seattle 

(D) STATE: WA 

(E) COUNTRY: USA 

(F) ZIP: 98102 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 



(vi ii ) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Lunn. Paul G 

(B) REGISTRATION NUMBER: 32,743 
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(C) REFERENCE/DOCKET NUMBER: 97-28PC 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 206-442-6627 

(B) TELEFAX: 206-442-6678 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1297 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 69. . .887 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

AAGCTTGGCA CGAGGTGGCA CGAGGCCTCG TGCCAAGCTT GGCACGAGGC CGCCTGGAGG 60 
CACAGGCC ATG AGG GGC TCT CAG GAG GTG CTG CTG ATG TGG CTT CTG GTG 110 
Met Arg Gly Ser Gin Glu Val Leu Leu Met Trp Leu Leu Val 
1 5 10 

TTG GCA GTG GGC GGC ACA GAG CAC GCC TAC CGG CCC GGC CGT AGG GTG 158 
Leu Ala Val Gly Gly Thr Glu His Ala Tyr Arg Pro Gly Arg Arg Val 
15 20 25 30 

TGT GCT GTC CGG GCT CAC GGG GAT CCT GTC TCC GAG TCG TTC GTG CAG 206 
Cys Ala Val Arg Ala His Gly Asp Pro Val Ser Glu Ser Phe Val Gin 
35 40 45 

CGT GTG TAC CAG CCC TTC CTC ACC ACC TGC GAC GGG CAC CGG GCC TGC 254 
Arg Val Tyr Gin Pro Phe Leu Thr Thr Cys Asp Gly His Arg Ala Cys 
50 55 60 

AGC ACC TAC CGA ACC ATC TAT AGG ACC GCC TAC CGC CGC AGC CCT GGG 302 
Ser Thr Tyr Arg .Thr He Tyr Arg Thr Ala Tyr Arg Arg Ser Pro Gly 
65 70 75 
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CTG GCC CCT GCC AGG CCT CGC TAC GCG TGC TGC CCC GGC TGG AAG AGG 350 
Leu Ala Pro Ala Arg Pro Arg Tyr Ala Cys Cys Pro Gly Trp Lys Arg 
80 85 90 

ACC AGC GGG CTT CCT GGG GCC TGT GGA GCA GCA ATA TGC CAG CCG CCA 398 
Thr Ser Gly Leu Pro Gly Ala Cys Gly Ala Ala He Cys Gin Pro Pro 
95 100 105 110 

TGC CGG AAC GGA GGG AGC TGT GTC CAG CCT GGC CGC TGC CGC TGC CCT 446 
Cys Arg Asn Gly Gly Ser Cys Val Gin Pro Gly Arg Cys Arg Cys Pro 
115 120 125 

GCA GGA TGG CGG GGT GAC ACT TGC CAG TCA GAT GTG GAT GAA TGC AGT 494 
Ala Gly Trp Arg Gly Asp Thr Cys Gin Ser Asp Val Asp Glu Cys Ser 
130 135 140 

GCT AGG AGG GGC GGC TGT CCC CAG CGC TGC GTC AAC ACC GCC GGC AGT 542 
Ala Arg Arg Gly Gly Cys Pro Gin Arg Cys Val Asn Thr Ala Gly Ser 
145 150 155 

TAC TGG TGC CAG TGT TGG GAG GGG CAC AGC CTG TCT GCA GAC GGT ACA 590 
Tyr Trp Cys Gin Cys Trp Glu Gly His Ser Leu Ser Ala Asp Gly Thr 
160 165 170 

CTC TGT GTG CCC AAG GGA GGG CCC CCC AGG GTG GCC CCC AAC CCG ACA 638 
Leu Cys Val Pro Lys Gly Gly Pro Pro Arg Val Ala Pro Asn Pro Thr 
175 180 185 190 

GGA GTG GAC AGT GCA ATG AAG GAA GAA GTG CAG AGG CTG CAG TCC AGG 686 
Gly Val Asp Ser Ala Met Lys Glu Glu Val Gin Arg Leu Gin Ser Arg 
195 200 205 

GTG GAC CTG CTG GAG GAG AAG CTG CAG CTG GTG CTG GCC CCA CTG CAC 734 
Val Asp Leu Leu Glu Glu Lys Leu Gin Leu Val Leu Ala Pro Leu His 
210 215 220 

AGC CTG GCC TCG CAG GCA CTG GAG CAT GGG CTC CCG GAC CCC GGC AGC 782 
Ser Leu Ala Ser Gin Ala Leu Glu His Gly Leu Pro Asp Pro Gly Ser 
225 230 235 

CTC CTG GTG CAC TCC TTC CAG CAG CTC GGC CGC ATC GAC TCC CTG AGC 830 
Leu Leu Val His Ser Phe Gin Gin Leu Gly Arg He Asp Ser Leu Ser 
240 245 250 
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GAG CAG ATT TCC TTC CTG GAG GAG CAG CTG GGG TCC TGC TCC TGC AAG 878 
Glu Gin He Ser Phe Leu Glu Glu Gin Leu Gly Ser Cys Ser Cys Lys 
255 260 265 270 

AAA GAC TCG TGACTGCCCA GCGCCCCAGG CTGGACTGAG CCCCTCACGC CGCCCTGCA 936 
Lys Asp Ser 



GCCCCCATGC CCCTGCCCAA CATGCTGGGG GTCCAGAAGC CACCTCGGGG TGACTGAGCG 996 

GAAGGCCAGG CAGGGCCTTC CTCCTCTTCC TCCTCCCCTT CCTCAGGAGG CTCCCCAGAC 1056 

CCTGGCATGG GATGGGCTGG GATCTTCTCT GTGAATCCAC CCCTGGCTAC CCCCACCCTG 1116 

GCTACCCCAA CGGCATCCCA AGGCCAGGTG GGCCCTCAGC TGAGGGAAGG TACGAGCTCC 1176 

CTGCTGGAGC CTGGGACCCA TGGCACAGGC CAGGCAGCCC GGAGGCTGGG TGGGGCCTCA 1236 

GTGGGGGCTG CTGCCTGACC CCCAGCACAA TAAAAATGAA ACGTGAAAAA AAAAAAAAAA 1296 

A 1297 



(2) INFORMATION FOR SEQ ID NO: 2: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



Met 


Arg 


Gly 


Ser 


Gin 


Glu 


Val 


Leu 


Leu 


Met 


Trp 


Leu Leu Val Leu Ala 


1 








5 










10 




15 


Val 


Gly 


Gly 


Thr 


Glu 


His 


Ala 


Tyr 


Arg 


Pro 


Gly 


Arg Arg Val Cys Ala 








20 










25 






30 


Val 


Arg 


Ala 


His 


Gly 


Asp 


Pro 


Val 


Ser 


Glu 


Ser 


Phe Val Gin Arg Val 






35 










40 








45 


Tyr 


Gin 


Pro 


Phe 


Leu 


Thr 


Thr 


Cys 


Asp 


Gly 


His 


Arg Ala Cys Ser Thr 




50 










55 










60 


Tyr 


Arg 


Thr 


He 


Tyr 


Arg 


Thr 


Ala 


Tyr 


Arg 


Arg 


Ser Pro Gly Leu Ala 


65 










70 










75 


80 


Pro 


Ala 


Arg 


Pro 


Arg 


Tyr 


Ala 


Cys 


Cys 


Pro 


Gly 


Trp Lys Arg Thr Ser 










85 










90 




95 


Gly 


Leu 


Pro 


Gly 


Ala 


Cys 


Gly 


Ala 


Ala 


He 


Cys 


Gin Pro Pro Cys Arg 








100 










105 






110 


Asn 


Gly 


Gly 


Ser 


Cys 


Val 


Gin 


Pro 


Gly 


Arg 


Cys 


Arg Cys Pro Ala Gly 



115 120 125 
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1 • P 


MI y 


V3 1 y Mb p 


i nr Lyb b i n oer 




1 ?0 






Apn 

Mr y 


u ly 


b ly byb 


ri u u i n Mr y byb 








1 jU 


byb 


u III 


P v/c Trrv 
byb 1 I p 


Pin P1\/ Mi c Qor 

b i u b iy nib oer 








lOD 


vd l 


rr U 


i \/c pi w 
i_y s b i y 


Pi \ / DpA Di^A Ann 

biy ri U ri U Mry 










Ac n 
Mbp 




Al a Mot 

m i a net 


1 \/c PI 1 1 PI 1 1 \/a 1 
LyS b 1 U b 1 U val 










Lgu 


Lgu 


PI 1 1 Plii 
b 1 U b 1 U 


Lys Leu bin Leu 




210 




215 


Ala 


Ser 


Gin Ala 


Leu Glu His Gly 


225 






230 


Val 


His 


Ser Phe 


Gin Gin Leu Gly 








245 


He 


Ser 


Phe Leu 


Glu Glu Gin Leu 






260 




Ser 









Asp Val Asp Glu Cys Ser Ala Arg 
140 

Val Asn Thr Ala Gly Ser Tyr Trp 
155 160 
Leu Ser Ala Asp Gly Thr Leu Cys 

170 175 
Val Ala Pro Asn Pro Thr Gly Val 
185 190 
Gin Arg Leu Gin Ser Arg Val Asp 
205 

Val Leu Ala Pro Leu His Ser Leu 

220 

Leu Pro Asp Pro Gly Ser Leu Leu 
235 240 
Arg He Asp Ser Leu Ser Glu Gin 

250 255 
Gly Ser Cys Ser Cys Lys Lys Asp 
265 270 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 254 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 

Thr Glu His Ala Tyr Arg Pro Gly Arg Arg Val Cys Ala Val Arg Ala 

15 10 15 

His Gly Asp Pro Val Ser Glu Ser Phe Val Gin Arg Val Tyr Gin Pro 

20 25 30 

Phe Leu Thr Thr Cys Asp Gly His Arg Ala Cys Ser Thr Tyr Arg Thr 

35 40 45 

lie Tyr Arg Thr Ala Tyr Arg Arg Ser Pro Gly Leu Ala Pro Ala Arg 

50 55 60 

Pro Arg Tyr Ala Cys Cys Pro Gly Trp Lys Arg Thr Ser Gly Leu Pro 
65 70 75 80 

Gly Ala Cys Gly Ala Ala He Cys Gin Pro Pro Cys Arg Asn Gly Gly 
85 90 95 



vJSDOCID: <WO 9857983A2_I_> 



WO 98/57983 



PCT/US98/ 12763 



49 



Ser 


Cys VaT Gin 


Pro 


Gly Arg Cys 


Arg Cys Pro Ala 


Gly 


Trp 


Ara 


Gl v 




100 






105 




110 


Asp 


Thr Cys Gin 


Ser 


Asp Val Asp 


Glu Cys Ser Ala 


Arq 


Arq 


Gly 


Gl v 




115 




120 




125 




Cys 


Pro Gin Arg 


Cys 


Val Asn Thr 


Ala Gly Ser Tyr 


Tro 


Cvs 


Gin 


Cvs 




130 




135 


140 






Trp 


Glu Gly His 


Ser 


Leu Ser Ala 


Asp Gly Thr Leu 


Cvs 


Val 


Pro 




145 






150 


155 






160 


Gly 


Gly Pro Pro 


Arg 


Val Ala Pro 


Asn Pro Thr Gly 


Val 


Asp 


Spr 

o cr t 


Al ?i 






165 




170 




175 




Met 


Lys Glu Glu 


Val 


Gin Arg Leu 


Gin Ser Arg Val 


Asp 


Leu 


Leu 


Glu 


Glu 


180 






185 


190 






Lys Leu Gin 


Leu 


Val Leu Ala 


Pro Leu His Ser 


i_6U 


Ala 


JCI 


Gl n 

VJ 1 f f 


Ala 


195 




200 




205 






Leu Glu His 


Gly 


Leu Pro Asp 


Pro Gly Ser Leu 


Leu 


Val 


His 


Ser 




210 




215 


220 










Phe 


Gin Gin Leu 


Gly 


Arg He Asp 


Ser Leu Ser Glu 


Gin 


He 


Ser 


Phe 


225 






230 


235 








240 


Leu 


Glu Glu Gin 


Leu 


Gly Ser Cys 


Ser Cys Lys Lys 


Asp 


Ser 








245 




250 









(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 284 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

GGCGGCGGCG CGTGCGCGCC CCGGATCCGG CGGCCACCCA GAGGAGAAGG CCACCCCGCC 60 

TGGAGGCACA GGCCATGAGG GGCTCTCAGG AGGTGCTGCT GATGTGGCTT CTGGTGTTGG 120 

CAGTGGGCGG CACAGAGCAC GCCTACCGGC CCGGCCGTAG GGTGTGTGCT GTCCGGGCTC 180 

ACGGGGACCC TGTCTCCGAG TCGTTCGTGC AGCGTGTGTA CCAGCCCTTC CTCACCACCT 240 

GCGACGGGCA CCGGGCCTGC AGCACCTACC GAACCATCTA TAGG 284 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: Other 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:5: 
TGCGGCGGTA GGCGGTCCTA TAGATGGTTC GGTAGGTGCT 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
GCTGATGTGG CTTCTGGT 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 
(iv) ANTISENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GGTAGGCGTG CTCTGTGC 

(2) INFORMATION FOR SEQ ID N0:8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 708 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:8: 



Thr 
1 1 1 1 


Hi «; 

1 1 1 J 


Arn 




i pi i 


Hi <; 

1 II o 
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1 1 c 


JCI 
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M 1 O 


1 pi i Al a 

LCU M 1 O 


Thr Tvr 
1 1 1 1 i y i 


Arn Al ^ Arn 
mi y Mid mi y 


1 








J 










1 0 




1 S 
j. j 


Gl V 

u iy 


Pro 


Arn 
mi y 


U 1 y 


1 PI 1 
LCU 


Tvr 

• y» 


Ala 

M 1 d 


Arn 
mi y 


Gl v 
u iy 


Al a Arn 
m i d mi y 


Glv Val 

O 1 jr V d 1 


Al a 1 pi i C v/c 
M I d LcU Uy o 


















J 






ou 


Tvr 

iyr 


Oct 


Al a 
M I d 


1 Pr i 


Ala 
M I d 


Val 
v d i 


Ala 
m i d 


1 pi i 
LcU 


Ala 
m i d 


Arn ftlv 
ni y b iy 


Al 3 1 oi i 
M 1 d LcU 


Alp* Hie T1p 
Mid n i o lie 






















4^ 




JCI 


p.i v 

b iy 


1 Ql 1 

LcU 


T w p 

iyr 


AT a 
M 1 d 


Car 

JCI 


Prn 
ri U 


Prn 


A rn 
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Vd 1 Mid 
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LcU OCI 


CI i i A m CI \/ 

b i u Hry b i y 
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b 1 U 


A rn 
Mi y 
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DO 
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1 UJ 






1 1 0 

llu 


T1p 

1 1 c 
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M 1 d 


Mry 
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M 1 d 


LcU 
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M I d 
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1 ?R 

1 <L J 
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Mr y 


Thr 
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<y r 


A rn 
Mr y 
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M I d 
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M 1 u 
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Gl V 
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Hi c; 


Arn 
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m i a 
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m i d 


Thr Tvr 
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Arn Al a 
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Arn filv A1a 
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i 

1JU 








1 
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.LOU 


Am 


Gl V 
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Arn 
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1 pi i 
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O 1 U Mid 


1 pi i Al a Prn 

LCU Mid n u 
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170 




± / j 


Arn 
mi y 


Ala 


1 PI 1 

LCU 


Ala 
m I a 


Ala 

m 1 a 


Arn 


Gl V 
vj i y 


Prn 
r i \j 


Arn 


Al f\ Arn 
r\ i a r\ i y 


Glv Thr 

VJ 1 jr 1 I 1 1 


Tvr Arn Al p» 

1 Jr 1 Ml y M l G 








180 










185 






190 


1 pii 


Ala 

l \ 1 CJ 


\*y o 


Tvr 


Spr 

JCI 




Tvr 


OCI 


Prn 


Arn Gl v 
mi y vj i y 


1 pi i Tvr 

lcu i y \ 


Thr Arn Prn 

(ill r\\ y r l KJ 






195 










200 






205 




Leu 


Tvr 
i j i 


Spr 


Ala 


Arn 


Gl v 


Thr 


His 


Arn 

r\i y 


Ser Glu 


Ara Glv 

ni y li i y 


1 pij Tvr 1 Ptl 

LCU 1 Jr 1 LCU 




210 










215 








220 




Glu 

vJ 1 U 


Pro 

r i vj 


Arn 
mi y 


Gl V 

u i y 


1 PI 1 
LCU 


Tvr 


Ala 

m i a 


1 pi i 

LCU 


Ala 

m i a 


fvc Tvr 

L- y 1 Jr 1 


oc i li i y 


1 pi i Tvr Al ^ 

LCU ' y ' M I G 


225 










230 








235 




240 


Leu 


Ala 


Ala 


Leu 


Ala 


He 


Leu 


Glu 


Cys 


Tyr Ser 


Gly Leu. 


Asn Pro Arg 










245 










250 




255 


Pro 


Arg 


Cys 


Tyr 


Ser 


Ala 


Arg 


Gly 


Ala 


Ser Asn 


Gly Leu 


Tyr Gly Leu 








260 










265 






270 


Tyr 


Ser 


Glu 


Arg 


Cys 


Tyr 


Ser 


Val 


Ala 


Leu Gly 


Leu Asn 


Pro Arg Gly 






275 










280 






285 




Leu 


Tyr 


Ala 


Arg 


Gly 


Cys 


Tyr 


Ser 


Ala 


Arg Gly 


Cys Tyr 


Ser Pro Arg 




290 










295 








300 




Ala 


Leu 


Ala 


Gly 


Leu 


Tyr 


Thr 


Arg 


Pro 


Ala Arg 


Gly Gly 


Leu Tyr Ala 


305 










310 








315 




320 
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Ser Pro 

Pro Val 

Leu Ala 

Ser Pro 
370 
Ser Asn 
385 

Arg Thr 

Gly Leu 

Ala Leu 

Ser Val 
450 
Arg Pro 
465 

Asn Pro 

Glu Arg 

Ala Leu 

Ala Arg 
530 
Leu Leu 
545 

Ala Leu 

Leu Ala 

He Ser 

Tyr Ser 
610 
Arg Pro 
625 

Arg Gly 
Gly Leu 



Thr His Arg 
325 

Ala Leu Ala 

340 
Ala Arg Gly 
355 

Arg Gly Leu 

Thr His Arg 

Arg Pro Cys 
405 

Gly Leu Tyr 

420 
Ala Ala Ser 
435 

Ala Leu Pro 

Arg Ala Arg 

Arg Thr His 
485 

Ala Leu Ala 

500 
Gly Leu Asn 
515 

Gly Val Ala 

Tyr Ser Leu 

Ala Pro Arg 
565 

Ser Glu Arg 

580 
Gly Leu Tyr 
595 

Glu Arg Leu 

His Glu Gly 

He Leu Glu 
645 

Gly Leu Asn 
660 



Cys Tyr Ser Gly 
Ser Pro 
Ala Arg 



Asn Ala 
375 
Ala Leu 
390 

Tyr Ser 

His He 

Pro Gly 

Arg Leu 
455 
Gly Val 
470 

Arg Gly 

Met Glu 

Ala Arg 

Leu Ala 
535 
Glu Gly 
550 

Leu Glu 

Gly Leu 

Leu Glu 

Glu Leu 
615 
Leu Asn 
630 

Ala Ser 
lie Leu 



Gly Leu 
345 
Gly Gly 
360 

Arg Gly 



Ala Gly 

Gly Leu 

Ser Ser 
425 
Leu Tyr 
440 

Tyr Ser 

Ala Leu 

Leu Tyr 

Thr Leu 
505 
Gly Leu 
520 

Ser Pro 

Leu Asn 

His He 

Asn Ala 
585 
Pro Arg 
600 

Glu Val 

Gly Leu 

Pro Ser 

Glu Ser 
665 



Leu Asn 
330 

Cys Tyr 

Leu Tyr 

Cys Tyr 

Leu Tyr 
395 
Asn Cys 
410 

Glu Arg 

Thr His 

Gly Leu 

Ala Leu 
475 
Val Ala 
490 

Tyr Ser 

Glu Gly 

Leu Glu 

Leu Glu 
555 
Ser Ser 
570 

Leu Ala 

Ala Ser 

Ala Leu 

Asn Leu 
635 
Glu Arg 
650 

Glu Arg 



Ser Glu Arg 

Ser Ser Glu 
350 

Gly Leu Tyr 

365 
Ser Val Ala 
380 

Ser Glu Arg 

Tyr Ser Thr 

Leu Glu Ser 
430 

Arg Leu Glu 

445 
Tyr Gly Leu 
460 

Ala Pro Arg 

Leu Ala Ser 

Gly Leu Gly 
510 

Leu Asn Ser 

525 
Leu Glu Gly 
540 

Val Ala Leu 

Glu Arg Leu 

Leu Glu Gly 
590 

Pro Pro Arg 

605 
His He Ser 
620 

Glu Gly Leu 

Leu Glu Ser 

Pro His Glu 
670 



Ala Ser 
335 

Arg Ala 

Cys Tyr 

Leu Ala 

Thr Tyr 
400 
Arg Pro 
415 

Glu Arg 

Cys Tyr 

Tyr Pro 

Ala Ser 
480 
Pro Ser 
495 

Leu Val 

Glu Arg 

Leu Gly 

Leu Glu 
560 
Glu Ala 
575 

Leu His 

Gly Leu 

Ser Glu 

Tyr Ala 
640 
Glu Arg 
655 

Leu Glu 
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Gly Leu Gly Leu Gly Leu Asn Leu Glu Gly Leu Tyr Ser Glu Arg Cys 

675 680 685 

Tyr Ser Ser Glu Arg Cys Tyr Ser Leu Tyr Ser Leu Tyr Ser Ala Ser 

. 690 695 700 

Pro Ser Glu Arg 
705 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

Ala He Cys Gin Pro Pro Cys Arg Asn Gly Gly Ser Cys Val Gin Pro 

15 10 15 

Gly Arg Cys Arg Cys Pro Ala Gly Trp Arg Gly Asp Thr Cys Gin 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Ser Asp Val Asp Glu Cys Ser Ala Arg Arg Gly Gly Cys Pro Gin Arg 

1 5 10 15 

Cys Val Asn Thr Ala Gly Ser Tyr Trp Cys Gin Cys Trp Glu Gly His 

20 25 30 

Ser Leu Ser Ala Asp Gly Thr Leu Cys Val 
35 40 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 256 amino acids 
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(B) TYPE: amino acid 
•(C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



Pro Arg Leu 


Tyr 


Ser 


Gly 


Leu 


Tyr 


Gly 


Leu Tyr Pro Arg Pro 


Arg Ala 


1 




5 










10 


15 


Arg Gly Val 


Ala 


Leu 


Ala 


Leu 


Ala 


Pro 


Arg Ala Ser Asn Pro 


Arg Thr 




20 










25 


30 




His Arg Gly 


Leu 


Tyr 


Val 


Ala 


Leu 


Ala 


Ser Pro Ser Glu Arg 


Ala Leu 


35 










40 




45 




Ala Met Glu 


Thr 


Leu 


Tyr 


Ser 


Gly 


Leu 


Gly Leu Val Ala Leu 


Gly Leu 


50 








55 






60 




Asn Ala Arg 


Gly 


Leu 


Glu 


Gly 


Leu 


Asn 


Ser Glu Arg Ala Arg 


Gly Val 


65 






70 








75 


80 


Ala Leu Ala 


Ser 


Pro 


Leu 


Glu 


Leu 


Glu 


Gly Leu Gly Leu Leu 


Tyr Ser 






85 










90 


95 


Leu Glu Gly 


Leu 


Asn 


Leu 


Glu 


Val 


Ala 


Leu Leu Glu Ala Leu 


Ala Pro 




100 










105 


110 




Arg Leu Glu 


1 1 • _ 

His 


lie 


Ser 


Ser 


Glu 


Arg 


Leu Glu Ala Leu Ala 


Ser Glu 


TIC 

lib 










i on 
12U 




125 




Arg G ly Leu 


Asn 


Ala 


Leu 


Ala 


Leu 


Glu 


Gly Leu His lie Ser 


Gly Leu 


130 








135 






140 




Tyr Leu Glu 


Pro 


Arg 


Ala 


Ser 


Pro 


Pro 


Arg Gly Leu Tyr Ser 


Glu Arg 


145 






150 








155 


160 


Leu Glu Leu 


Glu 


Val 


Ala 


Leu 


His 


He 


Ser Ser Glu Arg Pro 


His Glu 






165 










170 


175 


Gly Leu Asn 


Gly 


Leu 


Asn 


Leu 


Glu 


Gly 


Leu Tyr Ala Arg Gly 


He Leu 




180 










185 


190 




Glu Ala Ser 


Pro 


Ser 


Glu 


Arg 


Leu 


Glu 


Ser Glu Arg Gly Leu 


Gly Leu 


195 










200 




205 




Asn He Leu 


Glu 


Ser 


Glu 


Arg 


Pro 


His 


Glu Leu Glu Gly Leu 


Gly Leu 


210 








215 






220 




Gly Leu Asn 


Leu 


Glu 


Gly 


Leu 


Tyr 


Ser 


Glu Arg Cys Tyr Ser 


Ser Glu 


225 






230 








235 


240 


Arg Cys Tyr 


Ser 


Leu 


Tyr 


Ser 


Leu 


Tyr 


Ser Ala Ser Pro Ser 


Glu Arg 






245 










250 


255 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 331 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Thr His Arg Gly Leu His He Ser Ala Leu Ala Thr Tyr Arg Ala Arg 

1 5 10 15 

Gly Pro Arg Gly Leu Tyr Ala Arg Gly Ala Arg Gly Val Ala Leu Cys 

20 25 30 

Tyr Ser Ala Leu Ala Val Ala Leu Ala Arg Gly Ala Leu Ala His He 

35 40 45 

Ser Gly Leu Tyr Ala Ser Pro Pro Arg Val Ala Leu Ser Glu Arg Gly 

50 55 60 

Leu Ser Glu Arg Pro His Glu Val Ala Leu Gly Leu Asn Ala Arg Gly 
65 70 75 80 

Val Ala Leu Thr Tyr Arg Gly Leu Asn Pro Arg Pro His Glu Leu Glu 

85 90 95 

Thr His Arg Thr His Arg Cys Tyr Ser Ala Ser Pro Gly Leu Tyr His 

100 105 110 

He Ser Ala Arg Gly Ala Leu Ala Cys Tyr Ser Ser Glu Arg Thr His 

115 120 125 

Arg Thr Tyr Arg Ala Arg Gly Thr His Arg He Leu Glu Thr Tyr Arg 

130 135 140 

Ala Arg Gly Thr His Arg Ala Leu Ala Thr Tyr Arg Ala Arg Gly Ala 
145 150 155 160 

Arg Gly Ser Glu Arg Pro Arg Gly Leu Tyr Leu Glu Ala Leu Ala Pro 

165 170 175 

Arg Ala Leu Ala Ala Arg Gly Pro Arg Ala Arg Gly Thr Tyr Arg Ala 

180 185 190 

Leu Ala Cys Tyr Ser Cys Tyr Ser Pro Arg Gly Leu Tyr Thr Arg Pro 

195 200 205 

Leu Tyr Ser Ala Arg Gly Thr His Arg Ser Glu Arg Gly Leu Tyr Leu 

210 215 220 

Glu Pro Arg Gly Leu Tyr Ala Leu Ala Cys Tyr Ser Gly Leu Tyr Ala 
225 230 235 240 

Leu Ala Ala Leu Ala He Leu Glu Cys Tyr Ser Gly Leu Asn Pro Arg 

245 250 255 

Pro Arg Cys Tyr Ser Ala Arg Gly Ala Ser Asn Gly Leu Tyr Gly Leu 

260 265 270 

Tyr Ser Glu Arg Cys Tyr Ser Val Ala Leu Gly Leu Asn Pro Arg Gly 
275 280 285 
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Leu Tyr Ala Arg Gly Cys Tyr Ser Ala Arg Gly Cys Tyr Ser Pro Arg 

290 295 300 

Ala Leu Ala Gly Leu Tyr Thr Arg Pro Ala Arg Gly Gly Leu Tyr Ala 
305 310 315 320 

Ser Pro Thr His Arg Cys Tyr Ser Gly Leu Asn 
325 330 

(2) INFORMATION FOR SEQ ID N0:13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 158 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Thr Glu His Ala Tyr Arg Pro Gly Arg Arg Val Cys Ala Val Arg Ala 

1 5 10 15 

His Gly Asp Pro Val Ser Glu Ser Phe Val Gin Arg Val Tyr Gin Pro 

20 25 30 

Phe Leu Thr Thr Cys Asp Gly His Arg Ala Cys Ser Thr Tyr Arg Thr 

35 40 45 

He Tyr Arg Thr Ala Tyr Arg Arg Ser Pro Gly Leu Ala Pro Ala Arg 

50 55 60 

Pro Arg Tyr Ala Cys Cys Pro Gly Trp Lys Arg Thr Ser Gly Leu Pro 
65 70 75 80 

Gly Ala Cys Gly Ala Ala He Cys Gin Pro Pro Cys Arg Asn Gly Gly 

85 90 95 

Ser Cys Val Gin Pro Gly Arg Cys Arg Cys Pro Ala Gly Trp Arg Gly 

100 105 no 

Asp Thr Cys Gin Ser Asp Val Asp Glu Cys Ser Ala Arg Arg Gly Gly 

115 120 125 

Cys Pro Gin Arg Cys Val Asn Thr Ala Gly Ser Tyr Trp Cys Gin Cys 

130 135 140 

Trp Glu Gly His Ser Leu Ser Ala Asp Gly Thr Leu Cys Val 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 amino acids 

(B) TYPE: amino acid 



WO 98/57983 



PCT/US98/12763 



57 



(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Ala He Cys Gin Pro Pro Cys Arg Asn Gly Gly Ser Cys Val Gin Pro 

15 10 15 

Gly Arg Cys Arg Cys Pro Ala Gly Trp Arg Gly Asp Thr Cys Gin Ser 

20 25 30 

Asp Val Asp Glu Cys Ser Ala Arg Arg Gly Gly Cys Pro Gin Arg Cys 

35 40 45 

Val Asn Thr Ala Gly Ser Tyr Trp Cys Gin Cys Trp Glu Gly His Ser 

50 55 60 

Leu Ser Ala Asp Gly Thr Leu Cys Val 
65 70 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 169 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



Ala He 


Cys Gin 


Pro Pro Cys 


Arg Asn 


Gly 


Gly Ser Cys Val Gin Pro 


1 




5 




10 


15 


Gly Arg 


Cys Arg 


Cys Pro Ala 


Gly Trp 


Arg 


Gly Asp Thr Cys Gin Ser 




20 




25 




30 


Asp Val 


Asp Glu 


Cys Ser Ala 


Arg Arg 


Gly 


Gly Cys Pro Gin Arg Cys 




35 




40 




45 


Val Asn 


Thr Ala 


Gly Ser Tyr 


Trp Cys 


Gin 


Cys Trp Glu Gly His Ser 


50 




55 






60 


Leu Ser 


Ala Asp 


Gly Thr Leu 


Cys Val 


Pro 


Lys Gly Gly Pro Pro Arg 


65 




70 






75 80 


Val Ala 


Pro Asn 


Pro Thr Gly 


Val Asp 


Ser 


Ala Met Lys Glu Glu Val 






85 




90 


95 


Gin Arg 


Leu Gin 


Ser Arg Val 


Asp Leu 


Leu 


Glu Glu Lys Leu Gin Leu 




100 




105 




110 
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Val Leu Ala Pro Leu His Ser Leu Ala Ser Gin Ala Leu Glu His Gly 

115 120 125 

Leu Pro Asp Pro Gly Ser Leu Leu Val His Ser Phe Gin Gin Leu Gly 

130 135 140 

Arg He Asp Ser Leu Ser Glu Gin lie Ser Phe Leu Glu Glu Gin Leu 
145 150 155 160 

Gly Ser Cys Ser Cys Lys Lys Asp Ser 
165 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 181 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



Thr Glu His 


Ala 


Tyr 


Arg 


Pro 


Gly Arg 


Arg 


Val 


Cys 


Ala Val Arg Ala 


1 




5 








10 






15 


His Gly Asp 


Pro 


Val 


Ser 


Glu 


Ser Phe 


Val 


Gin 


Arg 


Val Tyr Gin Pro 




20 








25 








30 


Phe Leu Thr 


Thr 


Cys 


Asp 


Gly 


His Arg 


Ala 


Cys 


Ser 


Thr Tyr Arg Thr 


35 










40 








45 


He Tyr Arg 


Thr 


Ala 


Tyr 


Arg 


Arg Ser 


Pro 


Gly 


Leu 


Ala Pro Ala Arg 


50 








55 








60 




Pro Arg Tyr 


Ala 


Cys 


Cys 


Pro 


Gly Trp 


Lys 


Arg 


Thr 


Ser Gly Leu Pro 


65 






70 








75 




80 


Gly Ala Cys 


Gly 


Ala 


Pro 


Lys 


Gly Gly 


Pro 


Pro 


Arg 


Val Ala Pro Asn 






85 








90 






95 


Pro Thr Gly 


Val 


Asp 


Ser 


Ala 


Met Lys 


Glu 


Glu 


Val 


Gin Arg Leu Gin 




100 








105 








110 


Ser Arg Val 


Asp 


Leu 


Leu 


Glu 


Glu Lys 


Leu 


Gin 


Leu 


Val Leu Ala Pro 


115 










120 








125 


Leu His Ser 


Leu 


Ala 


Ser 


Gin 


Ala Leu 


Glu 


His 


Gly 


Leu Pro Asp Pro 


130 








135 








140 




Gly Ser Leu 


Leu 


Val 


His 


Ser 


Phe Gin 


Gin 


Leu 


Gly 


Arg He Asp Ser 


145 






150 








155 




160 


Leu Ser Glu 


Gin 


He 


Ser 


Phe 


Leu Glu 


Glu 


Gin 


Leu 


Gly Ser Cys Ser 






165 








170 






175 


Cys Lys Lys 


Asp 


Ser 

















180 
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(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 293 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Met Gly Ser Arg Ala Glu Leu Cys Thr Leu Leu Gly Gly Phe Ser Phe 

1 5 10 15 

Leu Leu Leu Leu He Pro Gly Glu Gly Ala Lys Gly Gly Ser Leu Arq 

20 25 30 

Glu Ser Gin Gly Val Cys Ser Lys Gin Thr Leu Val Val Pro Leu His 

35 40 45 

Tyr Asn Glu Ser Tyr Ser Gin Pro Val Tyr Lys Pro Tyr Leu Thr Leu 

50 55 60 

Cys Ala Gly Arg Arg He Cys Ser Thr Tyr Arg Thr Met Tyr Arg Val 
65 70 75 80 

Met Trp Arg Glu Val Arg Arg Glu Val Gin Gin Thr His Ala Val Cys 

85 90 95 

Cys Gin Gly Trp Lys Lys Arg His Pro Gly Ala Leu Thr Cys Glu Ala 

100 105 no 

He Cys Ala Lys Pro Cys Leu Asn Gly Gly Val Cys Val Arg Pro Asp 

115 120 125 

Gin Cys Glu Cys Ala Pro Gly Trp Gly Gly Lys His Cys His Val Asp 

130 135 140 

Val Asp Glu Cys Arg Thr Ser lie Thr Leu Cys Ser His His Cys Phe 
145 150 155 160 

Asn Thr Ala Gly Ser Phe Thr Cys Gly Cys Pro His Asp Leu Val Leu 

165 170 175 

Gly Val Asp Gly Arg Thr Cys Met Glu Gly Ser Pro Glu Pro Pro Thr 

180 185 190 

Ser Ala Ser He Leu Ser Val Ala Val Arg Glu Ala Glu Lys Asp Glu 

195 200 205 

Arg Ala Leu Lys Gin Glu He His Glu Leu Arg Gly Arg Leu Glu Arq 

210 215 220 

Leu Glu Gin Trp Ala Gly Gin Ala Gly Ala Trp Val Arg Ala Val Leu 
225 230 235 240 

Pro Val Pro Pro Glu Glu Leu Gin Pro Glu Gin Val Ala Glu Leu Trp 
245 250 255 
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Gly Arg Gly Asp Arg He Glu Ser Leu Ser Asp Gin Val Leu Leu Leu 

260 265 270 

Glu Glu Arg Leu Gly Ala Cys Ser Cys Glu Asp Asn Ser Leu Gly Leu 

275 280 285 

Gly Val Asn His Arg 
290 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1339 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 261. . .1094 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

GTAGGGCTCT GCCGGGACCT GGGTCTTCCC TCTCCTGGAG CTGCAGAGGC CAGAAGTTCA 60 

GTGGTGAGGG GTCCAAGGAG AGTCCGGGGA GACCAGGGAG GCTCTGTCCA TCCCCTGTCC 120 

CTGTCCCTGT GGGAAGCCCC CGGCAGCAGC AAGACGCTGG CTGTTCCACC TGCCCACAAG 180 

AACAGCCACC ACCAGTACCC AGGGGATGAC AAGCGGCCGG ACCACAGGCC ACAAAAAGAA 240 

GAAGGCTACC CCACTTACAG ATG CAG ACC ATG TGG GGC TCC GGA GAA CTG 290 

Met Gin Thr Met Trp Gly Ser Gly Glu Leu 
1 5 10 

CTT GTA GCA TGG TTT CTA GTG TTG GCA GCA GAT GGT ACT ACT GAG CAT 338 
Leu Val Ala Trp Phe Leu Val Leu Ala Ala Asp Gly Thr Thr Glu His 
15 20 25 

GTC TAC AGA CCC AGC CGT AGA GTG TGT ACT GTG GGG ATT TCC GGA GGT 386 
Val Tyr Arg Pro Ser Arg Arg Val Cys Thr Val Gly He Ser Gly Gly 
30 35 40 

TCC ATC TCG GAG ACC TTT GTG CAG CGT GTA TAC CAG CCT TAC CTC ACC 434 
Ser He Ser Glu Thr Phe Val Gin Arg Val Tyr Gin Pro Tyr Leu Thr 
45 50 55 
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ACT TGC GAC GGA CAC AGA GCC TGC AGC ACC TAC CGA ACC ATC TAC CGG 482 
Thr Cys Asp Gly His Arg Ala Cys Ser Thr Tyr Arg Thr He Tyr Arg 
60 65 70 

ACT GCC TAT CGC CGT AGC CCT GGG GTG ACT CCC GCA AGG CCT CGC TAT 530 
Thr Ala Tyr Arg Arg Ser Pro Gly Val Thr Pro Ala Arg Pro Arg Tyr 
75 80 85 90 

GCT TGC TGC CCT GGT TGG AAG AGG ACC AGT GGG CTC CCT GGG GCT TGT 578 
Ala Cys Cys Pro Gly Trp Lys Arg Thr Ser Gly Leu Pro Gly Ala Cys 
95 100 105 

GGA GCA GCA ATA TGC CAG CCT CCA TGT GGG AAT GGA GGG AGT TGC ATC 626 
Gly Ala Ala He Cys Gin Pro Pro Cys Gly Asn Gly Gly Ser Cys He 
110 115 120 

CGC CCA GGA CAC TGC CGC TGC CCT GTG GGA TGG CAG GGA GAT ACT TGC 674 
Arg Pro Gly His Cys Arg Cys Pro Val Gly Trp Gin Gly Asp Thr Cys 
125 130 135 

CAG ACA GAT GTT GAT GAA TGC AGT ACA GGA GAG GCC AGT TGT CCC CAG 722 
Gin Thr Asp Val Asp Glu Cys Ser Thr Gly Glu Ala Ser Cys Pro Gin 
140 145 150 

CGC TGT GTC AAT ACT GTG GGA AGT TAC TGG TGC CAG GGA TGG GAG GGA 770 
Arg Cys Val Asn Thr Val Gly Ser Tyr Trp Cys Gin Gly Trp Glu Gly 
155 160 165 170 

CAA AGC CCA TCT GCA GAT GGG ACG CGC TGC CTG TCT AAG GAG GGG CCC 818 
Gin Ser Pro Ser Ala Asp Gly Thr Arg Cys Leu Ser Lys Glu Gly Pro 
175 180 185 

TCC CCG GTG GCC CCA AAC CCC ACA GCA GGA GTG GAC AGC ATG GCG AGA 866 
Ser Pro Val Ala Pro Asn Pro Thr Ala Gly Val Asp Ser Met Ala Arg 
190 195 200 

GAG GAG GTG TAC AGG CTG CAG GCT CGG GTT GAT GTG CTA GAA CAG AAA 914 
Glu Glu Val Tyr Arg Leu Gin Ala Arg Val Asp Val Leu Glu Gin Lys 
205 210 215 

CTG CAG TTG GTG CTG GCC CCA CTG CAC AGC CTG GCC TCT CGG TCC ACA 962 
Leu Gin Leu Val Leu Ala Pro Leu His Ser Leu Ala Ser Arg Ser Thr 
220 225 230 
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GAG CAT GGG CTA CAA GAT CCT GGC AGC CTG CTG GCC CAT TCC TTC CAG 1010 

Glu His Gly Leu Gin Asp Pro Gly Ser Leu Leu Ala His Ser Phe Gin 
235 240 245 250 

CAG CTG GAC CGA ATT GAT TCA CTG AGT GAG CAG GTG TCC TTC TTG GAG 1"058 
Gin Leu Asp Arg He Asp Ser Leu Ser Glu Gin Val Ser Phe Leu Glu 
255 260 265 

GAA CAT CTG GGG TCC TGC TCC TGC AAA AAA GAT CTG TGATAACCTC TCACCA 1110 
Glu His Leu Gly Ser Cys Ser Cys Lys Lys Asp Leu 
270 275 

CCCAGGCTGG ATAGAGCAGT CATCCCTAGA TCCCTTGTAG CCAGAGTTCA GGCGCTGTCT 1170 

GGTGGTGCCT ATGAGCAGAA GGCCCTGCCT CATTGTCCCT CTTTCTTAGG AGGTTCCTAG 1230 

GACTTGGGCA TGGGGAGTGG GGTCTTGTGT GACTCTTCAG TGGGGCTCCC TGTCTAAGTG 1290 

GTAAGGTGGG GATTGTCTCC ATCTTTGTCA TAATAAAGCT GAGACTTGA 1339 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 278 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
'(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:19: 



Met 


Gin 


Thr 


Met 


Trp 


Gly 


Ser 


Gly 


Glu 


Leu 


Leu 


Val 


Ala 


Trp Phe Leu 


1 








5 










10 








15 


Val 


Leu 


Ala 


Ala 


Asp 


Gly 


Thr 


Thr 


Glu 


His 


Val 


Tyr 


Arg 


Pro Ser Arg 








20 










25 










30 


Arg 


Val 


Cys 


Thr 


Val 


Gly 


He 


Ser 


Gly 


Gly 


Ser 


He 


Ser 


Glu Thr Phe 






35 










40 










45 




Val 


Gin 


Arg 


Val 


Tyr 


Gin 


Pro 


Tyr 


Leu 


Thr 


Thr 


Cys 


Asp 


Gly His Arg 




50 










55 










60 




Ala 


Cys 


Ser 


Thr 


Tyr 


Arg 


Thr 


He 


Tyr 


Arg 


Thr 


Ala 


Tyr 


Arg Arg Ser 


65 










70 










75 






80 


Pro 


Gly 


Val 


Thr 


Pro 


Ala 


Arg 


Pro 


Arg 


Tyr 


Ala 


Cys 


Cys 


Pro Gly Trp 










85 










90 








95 


Lys 


Arg 


Thr 


Ser 


Gly 


Leu 


Pro 


Gly 


Ala 


Cys 


Gly 


Ala 


Ala 


He Cys Gin 



100 105 110 
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Pro Pro Cys Gly Asn GTy Gly Ser Cys lie Arg Pro Gly His Cys Arg 

115 120 125 

Cys Pro Val Gly Trp Gin Gly Asp Thr Cys Gin Thr Asp Val Asp Glu 

130 135 140 

Cys Ser Thr Gly Glu Ala Ser Cys Pro Gin Arg Cys Val Asn Thr Val 
145 150 155 160 

Gly Ser Tyr Trp Cys Gin Gly Trp Glu Gly Gin Ser Pro Ser Ala Asp 

165 170 175 

Gly Thr Arg Cys Leu Ser Lys Glu Gly Pro Ser Pro Val Ala Pro Asn 

180 185 190 

Pro Thr Ala Gly Val Asp Ser Met Ala Arg Glu Glu Val Tyr Arg Leu 

195 200 205 

Gin Ala Arg Val Asp Val Leu Glu Gin Lys Leu Gin Leu Val Leu Ala 

210 215 220 

Pro Leu His Ser Leu Ala Ser Arg Ser Thr Glu His Gly Leu Gin Asp 
225 230 235 240 

Pro Gly Ser Leu Leu Ala His Ser Phe Gin Gin Leu Asp Arg He Asp 

245 250 255 • 

Ser Leu Ser Glu Gin Val Ser Phe Leu Glu Glu His Leu Gly Ser Cys 

260 265 270 

Ser Cys Lys Lys Asp Leu 
275 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Thr Cys Asp Gly His Arg Ala Cys Ser Thr Tyr Arg Thr He Tyr Arg 

15 10 15 

Thr Ala Tyr Arg Arg Ser Pro Gly Leu Ala Pro Ala Arg 
20 25 

(2) INFORMATION FOR SEQ ID N0:21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 



MSOOCIO: <WO 9857983A2J_ 



WO 98/57983 



64 



PCT/US98/12763 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Gin Pro Gly Arg Cys Arg Cys Pro Ala Gly Trp Arg Gly Asp Thr Cys 

1 5 10 15 

Gin Ser Asp Val Asp Glu Cys Ser Ala Arg Arg Gly Gly Cys Pro Gin 
20 25 30 

(2) INFORMATION FOR SEQ ID NO:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

Cys Val Pro Lys Gly Gly Pro Pro Arg Val Ala Pro Asn Pro Thr Gly 

1 5 10 15 

Val Asp Ser Ala Met Lys Glu Glu Val Gin Arg Leu Gin Ser Arg Val 

20 25 30 

Asp Leu Leu Glu Glu 
35 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Gin Gin Leu Gly Arg He Asp Ser Leu Ser Glu Gin He Ser Phe Leu 

1 5 10 15 

Glu Glu Gin Leu Gly Ser Cys Ser Cys Lys Lys Asp Ser 
20 25 
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(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 255 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 



i nr 


b 1 U 


n 1 S 


Va 1 


iyr Arg 


rro oer Arg Arg 


va i 


Cys Thr 


va i b iy lie 


i 








c 
O 


1U 






i k 
lo 


Ser 


b iy 


b iy 


Ser 


Tin Cqk* 

1 1 1: jci 


Clii Thr Dho W^l 
u III 1 ill rut? Vd 1 


ri n 


Am 

Ml y Vd 1 


iyr b i n k ro 


















^n 


Iyr 


Leu 


i nr 


i nr 


uys ASp 


P~I w U-i c A r»ri Al ^ 

b iy mis Arg a i a 


tys 


oer inr 


iyr Arg inr 






OJ 






/in 








Tl o 


iyr 


A rn 
Mi y 


Thr 

i nr 


Mia i y r 


Mr y Mi y jci ri u 


b iy 


\/^l Thr 

vd i i nr 


rro Aia Arg 




ou 












DU 




Pro 
r I U 


Mr y 


Tw r 

iyr 


Al a 
M I a 


wo Lyb 


Pro ril \/ Trn i \/c 
n u V3 i y \ \ \j i_y b 


mi y 


Thr ^or 

i III Ofcf r 


O 1 y LcU rl U 


OO 








70 
/ u 




/ o 




ou 


fil V 
vj i y 


Ala 




fil V 

o i y 


Ala Ala 

M 1 a r\ 1 CI 


Tip fv<; fil n Prn 

1 I tr v_> jr o Vj I 1 1 n U 


Pro 
r i vj 




A<;n fil v fil \/ 

no II vj 1 ^ VJ 1 y 












an 






yo 


Ser 


Cys 


He 


Arg 


Pro Gly 


His Cys Arg Cys 


Pro 


Val Gly 


Trp Gin Gly 








100 




105 






110 


Asp 


Thr 


Cys 


Gin 


Thr Asp 


Val Asp Glu Cys 


Ser 


Thr Gly 


Glu Ala Ser 






115 






120 




125 




Cys 


Pro 


Gin 


Arg 


Cys Val 


Asn Thr Val Gly 


Ser 


Tyr Trp 


Cys Gin Gly 




130 








135 




140 




Trp 


Glu 


Gly 


Gin 


Ser Pro 


Ser Ala Asp Gly 


Thr 


Arg Cys 


Leu Ser Lys 


145 








150 




155 




160 


Glu 


Gly 


Pro 


Ser 


Pro Val 


Ala Pro Asn Pro 


Thr 


Ala Gly 


Val Asp Ser 










165 


170 






175 


Met 


Ala 


Arg 


Glu 


Glu Val 


Tyr Arg Leu Gin 


Ala 


Arg Val 


Asp Val Leu 








180 




185 






190 


Glu 


Gin 


Lys 


Leu 


Gin Leu 


Val Leu Ala Pro 


Leu 


His Ser 


Leu Ala Ser 






195 






200 




205 




Arg 


Ser 


Thr 


Glu 


His Gly 


Leu Gin Asp Pro 


Gly 


Ser Leu 


Leu Ala His 




210 








215 




220 




Ser 


Phe 


Gin 


Gin 


Leu Asp 


Arg lie Asp Ser 


Leu 


Ser Glu 


Gin Val Ser 


225 








230 




235 




240 


Phe 


Leu 


Glu 


Glu. 


His Leu 


Gly Ser Cys Ser 


Cys 


Lys Lys 


Asp Leu 










245 


250 






255 
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CLAIMS 
We claim: 

1. An isolated polynucleotide which encodes a' 
mammalian Zneul polypeptide wherein said polynucleotide 
encodes a polypeptide selected from the group SEQ ID NOs: 2- 
3,8, 9, 11-16, and 19-24 or a polypeptide which is at least 
90% identical to the polypeptides of said group and which 
retain the activity of said polypeptides. 

2. An isolated polynucleotide which encodes a 
peptide or polypeptide having at least 15 amino acid residues 
comprised of an epitope-bearing portion of a polypeptide of 
SEQ ID NOs: 2-3,8, 9, 11-16, and 19-24 or a polypeptide which 
is at least 90% identical to said polypeptides. 

3. The polynucleotide of claim 2 wherein the peptide 
or polypeptide is fused to a carrier polypeptide or other 
carrier molecule . 

4. An expression vector comprising the following 
operably linked elements: 

a transcription promoter; 

a DNA segment which encodes a Zneul polypeptide or 
a peptide or polypeptide which contains an epitope-bearing 
region of a Zneul polypeptide; and 
a transcription terminator. 

5. An expression vector comprising the following 
operably linked elements: 

(a) a transcription promoter; 

(b) a DNA segment encoding a chimeric polypeptide, 
wherein said chimeric polypeptide consists essentially of a 
first portion and a second portion joined by a peptide bond, 
said first portion being comprised of a mammalian polypeptide, 
said polypeptide being the amino acid sequences of SEQ ID NOs: 
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2-3,8, 9, 11-16, and 19-24 or a polypeptide which is at least 
90% identical to said amino acid sequences and said second 
portion being a second polypeptide or protein, 
(c) a transcription terminator. 

6. An isolated Zneul polypeptide selected from the 
group of amino acid sequences consisting of SEQ ID NOs : 2-3,8, 
9, 11-16, and 19-24 or a polypeptide which is at least 90% 
identical to said polypeptides. 

7. An isolated peptide or polypeptide having at 
least 15 amino acid residues comprised of an epitope -bearing 
portion of a polypeptide of SEQ ID NOs : 2-3,8, 9, 11-16, and 
19-24 or is at least 90% identical to said epitope bearing 
portion . 

8. The isolated peptide or polypeptide of claim 7 
wherein the epitope-bearing portion is selected from the group 
of amino acid sequence consisting of SEQ ID NOs : 20-23 or a 
peptide or polypeptide which is at least 90% identical to said 
epitope bearing portion. 

9. An antibody, antibody fragment or single-chain 
antibody that specifically binds to a mammalian polypeptide, 
said polypeptide being defined by the amino acid sequences of 
SEQ ID NOs : 2-3,8, 9, 11-16, and 19-24. 

10. The antibody, antibody fragment or single-chain 
antibody of claim 9 wherein said antibody, antibody fragment 
or single-chain antibody is humanized. 

11. A method for producing an antibody which binds 
to a peptide or polypeptide defined by SEQ ID NOs: 2-3,8, 9, 
11-16, and 19-24 or to a peptide or polypeptide which is at 
least 90% identical to said peptide or polypeptide comprising" 
inoculating an animal with said peptide or polypeptide or with 
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a nucleic acid which encodes said peptide or polypeptide, 
wherein said animal produces antibodies to said peptide or 
polypeptide; and 

isolating said antibody. 

12. An anti- idiotypic antibody, ant i- idiotypic 
antibody fragment or anti-idiotypic single-chain antibody 
which binds to an antibody, an antibody fragment or single- 
chain antibody of peptide or polypeptide defined by SEQ ID 
NOs : 2-3,8, 9, 11-16, and 19-24 or to a peptide or polypeptide 
which is at least 90% identical to said peptide or 
polypeptide . 

13. The antibodies of claims 9-12 wherein said 
antibodies are either polyclonal or monoclonal. 
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